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Dear Sir: 

I, Dr. Nils Lonberg, declare the following: 

1 . I, Dr. Nils Lonberg, am presently the Senior Vice President and Scientific Director at 
Medarex, Inc. in Milpitas, California, the assignee of the above-referenced patent application. I 
received a Ph.D. in Biochemistry and Molecular Biology from Harvard University and 
completed a Post-Doctoral Fellowship at Memorial Sloan-Kettering Cancer Center in New York, 
New York. My curriculum vitae is attached herewith as Appendix A. 

2. I have reviewed claims 64-66 of the above-referenced application which are drawn to an 
isolated human monoclonal anti-human CD89 antibody, or antigen binding portion thereof, 
comprising a heavy chain variable region derived from a human germline Vh 3-30.3 gene and a 
light chain variable region derived from either a human germline V K LI 8 gene or V K A27 gene. 

3. I understand that claims 64-66 of the above-referenced application have been rejected as 
being indefinite. Specifically, the Examiner asserts that claims 64-66 are indefinite based on 
reference to the V H 3-30.3, V K LI 8, and V K A27 germline genes "because the characteristics of 
these genes are not known." 

4. It is my opinion that, prior to the filing date of the present application, the meaning of the 
above-mentioned human germline genes would have been clear and definite to one of ordinary 
skill in the art, including the specific characteristics {e.g., sequences) of these genes. In 
particular, as evidenced by the enclosed references (discussed in detail below), not only was the 


Application No.: 10/073644 


Docket No.: MXI-211 


nomenclature of these genes well established, accepted and known in the art by the filing date of 
the application, but also they had been mapped and sequenced in their entirety. In addition, as 
also evidenced by the enclosed references, each of these Vh 3-30.3, V K LI 8, and V K A27 
designations corresponds to a single gene (i.e., a single allele), the full-length sequence of which 
was known and publicly available at the filing date of the present application. As such, use of 
these gene designations in the claims of the present application would have been clear and 
definite to one of ordinary skill in the art. 

5. With respect to the human light chain germline genes, Vk LI 8 and Vk A27, 1 refer to the 
following scientific review entitled "Immunoglobulin Genes," Second Ed., (1995) edited by T. 
Honjo and F.W. Alt, Academic Press, which summarizes the knowledge in the art as of 1995 
with respect to the organization, structure and nomenclature of the light chain immunoglobulin 
genes. In particular, chapter 8, entitled "The human immunoglobulin k genes" by Hans Zachau, 
discusses the human kappa locus (see pages 173-191; attached herewith as Appendix B). As 
described in Appendix B, the human Vk genes were first isolated and sequenced as early as 
1980. An outline of the human kappa locus is provided in Figure 2 (page 175) and shows that 
the k proteins have been classified into four subgroups, I-IV with the Vk LI 8 gene and Vk A27 
genes classified in subgroups I and III, respectively. Moreover, as also shown in Figure 2, these 
genes correspond to a single allele. Moreover, the complete sequences for these genes had been 
published, for example, by Schable and Zachau (1993) Biol. Chem. 374:1001-1022 (attached 
herewith as Appendix C; see page 1020 providing the amino acid sequences of Vk LI 8 and Vk 
A27). 

6. With respect to the human heavy chain germline gene Vh 3-30.3, 1 refer again to the 
Honjo review. In particular, chapter 7, entitled "Immunoglobulin heavy chain loci of mouse and 
human" by Tasuku Honjo and Fumihiko Matsuda, discusses the nomenclature of the human V H 
locus (see pages 145-171 ; attached herewith as Appendix D). The human Vh regions have been 
divided into three subgroups based on amino acid sequence homology. These three subgroups 
have further been divided into six V H families (see Figure 1, page 147). Subgroup III contains 
the largest number of members, yet constitutes a single family, Vh3. Based on the established 
nomenclature, each Vh segment is named according to its family number and the order from the 
3* end of the V H locus. An insertional polymorphic Vh segment is indicated by a number with a 
decimal point (see, page 149, lines 22-27). Accordingly, the V H 3-30.3 gene refers to a specific 
polymorphic variant of the Vh 3 family member which is the thirtieth gene from the D region. 
In addition, the complete amino acid sequence of allele 3 of Vh 3-30, i.e., Vh 3-30.3, had been 
determined and published prior to the filing date of the present application by, for example, 
Chang and Siegel (1998) Am. Soc. Hematol 21(8)3066-3078 (attached herewith as Appendix E; 
see Figure 2(a)). 

7. In conclusion, as shown by the foregoing pre-filing publications, the light and heavy 
chain immunoglobulin germline gene nomenclature recited in the claims of the present 
application was well known and accepted in the art prior to the filing date of the present 
application. Moreover, these gene designations (Vh 3-30.3, Vk L18, and V K A27) were known 
to correspond to a single gene sequence (i.e., a single allele) which was also publicly available 
prior to the filing date of the application. Accordingly, reference to these gene designations in 
the present claims would have been clear and definite to one of ordinary skill in the art at the 
time of filing. 
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9. I have been warned that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under §1001 of Title 18 or the United States Code, and that such 
willful and false statements may jeopardize the validity of the subject application or any patent 
resulting therefrom, and declare that all statements made of our own knowledge are true and that 
all statements made on information and belief are believed to be true. 



Date: 


10 - / 3>~0$^~ 
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Curriculum Vitae 


Work Address: 

Telephone: 
Internet E-mail: 


Nils Lonberg 

2-23-56, Berkeley, CA 

227-92-3993 

168 Bardet Rd 
Woodside, CA 94062 

Medarex 

521 Cottonwood Drive 
Milpitas, CA 95035 

home: (650)851-1442 
work: (408)545-2721 

nlonberg@medarex.com 


Education: 

Undergraduate: 

Graduate: 
Postdoctoral: 


1974-1977 

1977- 1978 

1978- 1979 

1979- 1985 
1985-1989 


Reed College 
University of Oregon 
Reed College 

Harvard University, Ph.D v March 1985 
Sloan-Kettering Institute for Cancer Research 


Employment: 1990-1994 


1994-1997 


1998-2000 


Senior Scientist 

GenPharm International, Mountain View, CA. 

Director, Molecular Biology 
GenPharm International, Palo Alto, CA. 

Vice President, Scientific Director 
GenPharm International, San Jose, CA. 


2000-present 


Senior Vice President, Scientific Director 
Medarex, Milpitas, CA. 
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The Variable Genes of the Human Immunoglobulin x Locus 

KarlheinzF. Schable and HansjG. Zachau 
Institut fur Physiologische Chemie der UniversitSt Miinchen 

(Received 23 August 1993) 


Human immunoglobulin genes of the x type have 
been studied intensively during the past several years. 
Since the early work of Bentley and Rabbitts* 1,21 a 
number of laboratories have contributed to the 
knowledge of the immunoglobulin x genes and their 
function. About a dozen years ago, we embarked on 
a systematic study of the structure of the x locus, the 
mechanism of Y,-J x rearrangements and the expres- 
sion of the rearranged genes. The work was reviewed 
in regular intervals 13 " 51 . Now that our studies are draw- 
ing to a close, a general review was written 161 describ- 
ing the results and conclusions on the V M gene re- 
pertoire, on somatic hypermutation, the upstream 
regulatory elements and the various rearrangements 
and translocations of the Y, genes; also some biomedi- 
cal implications and evolutionary aspects were briefly 
dealt with. Since probably ally, genes of the locus are 
now known, it seems timely to compile their se- 
quences and to review the conclusions which can be 
reached from the comparisons. 

1 ) The structure of the human x locus 
Fig. 1 gives an overview of the x locus. It serves as an 
introduction to this review and, at the same time, pre- 
sents some results of sequence comparisons which 
will be discussed below. The locus is largely dupli- 
cated comprising a C x proximal contig (p) of 600 kb 
and a distal contig (d) of 440 kb, which are separated 
by 800 kb of as yet uncloned and probably^ gene-free 
DNA. The detailed restriction map of the locus com- 
prises 3Mb, while additional 4Mb were mapped at 
low resolution* 7,81 . 


The two large contigs p and d emerged from smaller 
ones that had been studied separately on numerous 
cosmid and phage A clones: Op/Od [9J , Ap/Ad ll0] , 
Lp/Ld IU421 , B [131 and VC-xde 1 14,151 . References to 
previous reports on the regions are cited in the quoted 
publications. The small contigs were fused to form 
the two large ones by chromosomal walking* 9 " 151 . 
Although we have good indirect evidence that the x 
locus has been fully or almost fully cloned* 6,121 we try 
to extend, with the help of YAC clones, the existing 
contigs at their 5' and 3' sides. 

2) Number and classification of the V x genes 

The p contig comprises the single C x , five J x and 40 V x 
gene segments, while in the d contig 36 V x gene seg- 
ments were found. There are 10 solitary V x genes and 
33 gene pairs whose sequences are 95-100% identical 
in the coding regions. 32 of the 76 V* genes are poten- 
tially functional, 16 have minor defects, 25 are 
pseudogenes and for three genes both potentially 
functional and slightly defective alleles have been 
found. 

The minor defects are defined as one or two one-bp 
alterations in a gene, for instance a stop codon in the 
coding part and/or a deviation from the canonical se- 
quences of regulatory elements, splice sites or the 
hepta- and nonanucleotide recombination sites. The 
replacement of a codon for an invariant amino acid 
(see below) by another codon is not considered to be 
such a defect, although for some V x genes with such 
alterations no mRNA or x proteins have been 


Abbreviations: 

V x , J x and C„: variable, joining and constant gene segments of the x locus, respectively; Op/Od, Ap/Ad, Lp/Ld and B regions: V x 
gene-containing regions of the C M proximal (p) or distal (d) copies of the human x locus, respectively (see Fig. 1); bp: base pair(s); 
kb: kilobase(s) or 1000 bp; Mb: megabase(s) or 10 6 bp; YAC: yeast artificial chromosome; PCR: polymerase chain reaction; pf: 
potentially functional; md: minor defects; ps: pseudo; dc: decanucleotide box; pd: 15-mer box. 
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found 1161 . The 16 genes with minor defects are defined 
as a separate class of genes, since also for them as for 
the three just mentioned ones potentially functional 
alleles may exist in the human population* 
All known V* genes and pseudogenes of the « locus on 
chromosome 2pll-12 are listed together with some 
pertinent data in Table 1. The sequences of all poten- 


Od 


Ad 


Ld 



42/U3S - 
28/724 3 


- cp 


Op 


Ap 


Lp 


B 


tially functional and slightly defective V x genes and 
also some pseudogene sequences have been pub- 
lished previously, and the data on the remaining 
pseudogenes are presented in another report^ 171 . Also 
included in Table 1 are the so-called orphon V H genes 
which lie outside the x locus on the same chromosome 
or on others. Up to now, 12orphons were localized on 
the long arm of chromosome 2, one on its short arm 
and 11 on other chromosomes. Most of the orphons 
turned out to be pseudogenes, not onV because of 
their location but also because of defects in their se- 
quences. Two orphon V„ genes (V108 and V268) have 
sequences that conform to the criteria of potential 
functionality; they are, however, listed as .pseudo- 
genes in Table 1 because of their location on the long 
arm of chromosome 2 (V108; ref. 1181 ) and on the short 
arm of this chromosome, but on the 3' side of C x at a 
distance of 1.5Mb (V268; ref. [19} ). There are probably 
further orphons scattered across the genome 1201 , but 
there is no compelling reason for us to study them 
exhaustively. 

The evidence that all V x genes of the locus have been 
cloned is indirect. It rests on three arguments. 

(i) No V x genes were found in the admittedly rela- 
tively short extending regions of the existing 
contigs' 9 ' 141 . 

(ii) Semiquantitative analyses of blot hybridizations 
of genomic DNA digests with gene probes in- 
dicate that the number of signals not assigned to 
the cloned V x genes at the time is small 121 J ; by now 
the genes corresponding to such signals have 
either been cloned or are attributed to orphon 
genes' 12 ' 20 '. 

(iii) In analyses of the\i mRNA/cDNAs and x pro- 
teins all gene products could be assigned to the 
known germ-line V x genes and no evidence for 
the existence of hitherto undetected germ-line V H 
genes was found* 1 ^. 

Since the three arguments are circumstantial we can 
only state that the number of V x genes and pseudo- 
genes in the locus very likely is 76. 


Fig. 1. Outline of the human x locus. 

Open boxes represent potentially functional genes, the filled 
boxes pseudogenes. Open boxes with crossed lines indicate 
genes with minor defects (as defined in the text) . The three 
genes for which potentially functional and slightly defective al- 
leles are known are marked by one diagonal line. The sub- 
groups are indicated by Roman numerals. The deletions (A) in 
the A and L regions have been described^ 10 ' 11 !. The drawing is 
not to scale. The number of nucleotides sequenced in both the 
p and d copy gene regions are given together with the number 
of differences between the regions. An asterisk at L8/L22 indi- 
cates that only the part of L22 which is highly homologous to 
L8 (12J is compared. 
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Table 1. The genes of the human x locus (011-B3) and the orphon V x genes (Wl-V268a) . 

Description 8 

Characteristics 15 

Divergence 6 
L V 

Described 
in 

Acces- 
sion 11 
number 

Oil 
01 

Olla (V3a) 

II f St, 
II, St, 

n,fi# 

pf 

0 

0 

3(2) 

[9] 
[9] 

X59314 
X59311 
X59317 

012 
02 

012a (V3b) 

I, St, 
I , st, 

I,fl, 

pf ,RP» mT ^ 

pf ( RTp 

md 

0 

KD 

0 

8(3) 

[93 
[9] 
[9] 

X59315 
X59312 
X59318 

013 
03 

II, St, 
II, St, 

ps 
ps 

- 

1 

[17] 
[17] 

X71888 
X71B92 

014 (Q5) 
04 

04/14a (DILpl) 

I'st, pf{ T 

I, St, pff T 
I,DIL,pf 

0 

0 
0 

[9] 
[9] 
[52] 

X59316 
X71893 

015 (Q8) 
05 

II, st, 
II, St, 

ps 
ps 


0 

[17 ] 
[17] 

X71889 
X71893 

016 (Ql) 
06 

06/16a (V55) 

I, st, 
I, St, 

I, st, 

ps 
ps 
ps 

0 
0 

1 

2 

[17] 
[17] 
[53] 

X71890 
X71894 
X00749 

.017 
07 

III, St, 
III, st, 

ps 
ps 

0 

0 

[17] 
[17] 

X71891 
X71895 

018 
08 

018 a 

I, St, 

i, st, 

I, St, 

pfl RTP 
pf 

0 

1(1) 

0 
0 

[25] 
[25] 
[25] 

M64856 
M64855 
M64857 

A15 
09 

A15a (HK100) 

i,st, 

I, St, 
I,fl, 

ps 
ps 
ps 

1 

0 

1 
0 

[17] 
[17] 
[1] 

X71883 
X71896 
V00560 


A16 
O10 

Al6a 

<HumXv301) 
A16/O10b 
(Humkv310) 


III, St, ps 

III, St, ps 

III,fl, ps 

III,fl, ps 


1 1 

0 


[17] 
[17] 
[54] 

[54] 


X71883 
X71896 
M17765 

M17764 


Z- ?. 
z \ 


Z I 


A17 
Al 


II, St, 
II, St, 

pf ,RTP 
pf,T 

0 

1(1) 

[10] 
[10] 

X63403 
X63402 


A18 
A2 


II, St, 
II, St, 

md,RT 
pf ,TP 

1 

6(5) 

[10] 
[55] 

X63396 
M31952 

! I 

A19 
A3 

(Q7) 

II, N, 
II, st, 

pf,Rp{ RTP 

0 

0 

[10] 
[56] 

X63397 
X12690 

^ 1 

A20 

A4 

A4a 

(Y2) 
(V52) 

I, N, 
I, St, 
I/St, 

md,RT 

md 

md 

2(1) 
2(1) 

2(1) 
2(1) 

[10] 
[56] 
[53] 

X63398 
X00748 


A21 
A5 


II, St, 
II, st, 

md 
ps 

0 

3 

[17] 
[56] 

X71884 
X12689 


A22 
A6 


III,St, 
III, St, 

ps 
ps 

1 

6 

[57] 
[17] 

X12685 
X71886 


A23 
A7 


II, st, 
II, st, 

pf ,RT 
Pf 

0 

4(3) 

[56] 
[10] 

X12684 
X63401 


A24 
A8 


II, st, 
II, st, 

ps 
ps 

0 

2 

[17] 
[17] 

X71885 
X71887 


A25 
A9 


i,st, 
i,st, 

ps 
ps 

1 

2 

[17] 
[17] 

X71885 
X71887 


A26 
A10 


VI, st, 
VI, st, 

md> i* 
mdf 

0 

0 

[10] 
[27] 

X63399 
X12683 
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Description* 

Characteristics'' 

Divergence 0 
L V 

uescriucu 
in 

sion d 
number 


A27 
A27a 

(HumJcv325) 
All 
Alia 

(HumXv3 05) 

III,St, pf,RTP 
III,fl, pf 

III, St, pf,R 
III, fl r Pf* 

0 

0 
0 

0 

5(4) 
5(4) 

[56] 
(58] 

[56] 

X12686 
M15038 

X12687 
M14507 

2 

A28 
A12 

II, St, ps 
II, st, ps 


3 

[56] 
[17) 

X12692 
X71882 


A2 9 
A13 

II, St, md 
IX, St, md 

1(1) 

8(4) 

[10] 
[10] 

X6340O 
X63395 


A14 

VI, St, md 



[27] 

X12688 


A3 0 

L14 (Q4) 

I,fl, pf,RTP 
I, St, pf,R 

0 

14 (10) 

[12] 
[12] 

X72808 
X63392 

Z 

LI (HK137; Q14) I,fl, pf,TP 
L15 (HK134; Q13) I,fl, pf 
L15a (HKlOl) I,fl, pf 

1 

1 

7(4) 
8(5) 

[2] 
[2] 
[1] 

J00248 
K01323 
V00558 

Z 

L2 III,Hah,p,RT) 
(Humkv328h5;Qll) [RTP 
L16 (QIO) III, St, p,T ) 
L16a III, Hah, pf 
(HumXv328h2) 

L16b III,Les,pf 
(Humkv328) 

L16c III,Les,md 
(Humkv329) 

0 
0 

0 

1 
1 

i 

X 

3(2) 

[59] 

[12] 
[59]- 

[59] 

[59] 

M23090 

X72815 
M23089 

M23088 

M23091 

1 

L3 (Q12) 
L17 

II,Hah,ps 
II, St, ps 

1 

3 

[12] 
[12] 

X72810 
X72811 


L4 (Va) 
LIB (Va") 
L4/18a (V4a) 

I, St, md i 
I, St, md,T> F 
I#fl. Pf 

0 
0 

1(D 

2 (2) 

[60] 
[61] 
[60] 

X00903 
X17262 
X00900 

1 

L5 (Vb) 
L19 (Vb") 
L5/19a (V4b) 

I, St, pf,T{ RT p 
I, St, pf * 
I,fl, Pf 

U 
1 

1 
0 

[60] 
[61] 
[60] 

V01577 
X17263 
V01576 


L6 (Vg) 
L2 0 (Vg") 
L6a (38K) 

III, St, pf ,RT 
III, St, pf,R 
III, pf 

2(1) 

4(3) 

ft 

u 

[62] 
[61] 
[63] 

X01668 
X17264 

>> 
6 

L7 (Vc) 
L21 (Vc") 

II, St, ps 
II, St, ps 

0 

8 

[60] 
(61] 

X00904 
X17265 


L8 (Vd) 
L22 

I , St , md , RT 
I, St, md 

0 


[60] 
[12] 

X00902 
X72816 


L23 (Q2) 
L23a 

I,GM607,pf 
I, N, pf 

0 

0 

[12] 
[12] 

X72817 
X72818 

/5k 

L9 (Ve) 
L2 4 (Ve"; Q3; 
V13) 

L24a 

I, St, md f R 
I, St, pf ,RT 

I,N , pf 

1(1) 
1(1) 

12(8) 
12(8) 

[60] 
[53] 

[12] 

X00901 
X00750 

X72819 

\ 
I 

LIO (Vh) 
LlOa 

L25 (V138; Q9) 

III, st, md 
III, N, md 
III, st, pf 

2(*) 

9(5) 

[62] 
[12] 
[12] 

X02725 
X72812 
X72820 

1 

Lll (Vf) 

I, St, pf,T 



[25] 

M64858 

» 

L12 (HK102,V1) 
Ll2a 

I,fl, pf 

I, ML, pf,RTP 


5(1) 

[1,53] 
[12] 

V00559 
X72813 

i 

L13 (Q6) 

II, N, md 



[12] 

X72814 
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Description' Characteristics b Divergence* Described Acces- 

L V in sion d 

number 


Bl VII, St, rod [13] X12682 


B2 (EV1S) 

V,P1, pf,RT 

[64] 

X02485 

1 

B3 

IV, St, pf,RTP 

[65] 

Z00023 

1 


Wl 

II, St, 

ps 

[66] 

X05101 

Via 

II, St, 

ps 

[67) 


W2 

II, St, 

ps 

[68] 

X51884 

W3 

I, St, 

DS 

[66] 

X05102 

W4 

II, st, 

ps 

[68] 

X51883 

W5 

III. St. 

DS 

[681 

X51882 

V6 

I, St, 

ps 

[66] 

X05103 

W7 

II, St, 

ps 

[68] 

X51881 

W8 

II, St, 

ps 

[68] 

v c i r> o f\ 

X5188 O 

W9 

I, St, 

ps 

[68] 

X51879 

W10 

II, st, 

ps 

[68] 

X51886 

Wll 

I, st, 

ps 

[68] 

X51885 

Chr22-1 

I, st, 

ps 

[69] 

Z00040 

Chr22-2 

III, St, 

ps 

[69] 

Z00042 

Chr22-3 

II, st, 

ps 

[69] 

Z00041 

Chr22-4 

II, st, 

ps 

[70] 

M20808 

Chr22-5 

I/St, 

ps 

[53] 

X00747 

Chr22-5a 

(V2b> I,fl, 

PS 

[53] 

X00746 

Chrl 

I,St, 

ps 

[69] 

M20809 

zi 

i,st, 

ps 

[71] 

M23653 

22 

I, St, 

ps 

[20J 

X64640 

23 

I,St, 

ps 

[20] 

X64641 

24 

i,st, 

ps 

[20] 

X64642 

Vllft 

i,st, 

ps 

[70] 

M20812 

V108 

i,st, 

ps(pf) 

[18] 

X51887 

V268 

III,St, 

ps(pf) 

[19] 

X74459 

V268a 

III, St, 

ps(pf) 

[19] 

X74460 


a In addition to the current designations previous and alternative ones are given in brackets. The Q/Y 
nomenclature was used in ref J 211 . Our "standard" alleles isolated from large contigs are named by 
straight numbers, e.g. L16; a, b, c is added to additional isolates, e.g. L16b. If an allele cannot be as- 
signed to the proximal or distal copy of the x locus, it receives a double designation, e.g. L5/19a. The 
CVproximal gene of a pair of duplicates is always listed first . . 

b This column gives several features of the V x genes: the subgroups (see text); sources of DNA; fl, fetal 
liver; St, N, different placenta DNA; GM607, DNA of a lymphoid cell line; the other source designa- 
tions are taken from the respective references, pf, ps and md refer to potentially functional genes, 
pseudogenes and genes with minor defects, respectively (see text). The orphon genes V108,V268 and 
V268a are classified as pseudogenes although they have no obvious defects in their sequence. They are 
therefore designated ps (pf). R.Tand P indicate that for the gene(s) a rearrangement product, a trans- 
cription product or a protein is known 1161 . The designation is given to a gene pair if the expression prod- 
uct could not be assigned to the p or d copy. For the special situation in the leader of L25 (asterisk) see 
Fig.4candref. Iul . 

c This column shows the numbers of nucleotides (amino acids in parentheses) in exons 1 and 2 which 
are divergent between duplicate genes. In the case of alleles the numbers are differences to the p copy 
genes. 

d Accession numbers listed are for EMBL and GenBank data libraries. 


3) % Z ene duplicates and the germ-line repertoire 
of V H genes 

The degree of divergence between duplicate^ genes 
is shown in Table 1 for exons 1 and 2 and in Fig. 1 for 


the total regions sequenced in both genes of a pair. 
The average extent of divergence is somewhat higher 
for pairs of pseudogenes than for pairs of potentially 
functional genes. In addition, the average divergence 
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seems to be lower in the .O regions than in the A and L 
regions. Both features may be interpreted in terms 
of a surveillance mechanism which counteracts the 
mutational divergence. On the average, the p and d 
copy-derived sequences of gene regions (Fig. 1) differ 
by about 1% (404 of 38136bp) indicating that the 
duplication is an evolutionarily, relatively recent 
event. 1% divergence is believed to correspond to 10 6 
years of evolution* 221 . But the differences in the extent 
of divergence of the. various duplicated gene regions 
will thwart all attempts to date the duplication event 
exactly. 

Because of the generally high similarity among the 
gene pairs the duplication of the x locus has not (yet?) 
increased much the repertoire of different germ-line 
V x genes. Since in most, although not in all cases both 
genes of a pair belong to the same category being 
either potentially functional, slightly defective or 
pseudogenes, the selective advantage of the duplica- 
tion in allowing the members of a pair to diverge from 
each other cannot be high. In general, the \^ genes of 
the d copy are expressed to a lesser extent than the p 
copy genes, but there are unequivocal examples of ex- 
pressed genes of the d copy* 7,16 * 231 . One feature of the 
duplicated structure, may become advantageous in 
cells that require a second V x - J K rearrangement in 
order to arrive at a functional joint: since most V x 
genes of the p copy are rearranged by a deletion 
mechanism the inversional rearrangement of the d 
copy V x genes may be useful in preserving the \£ gene 
repertoire* 7,231 . 

The existence of an apparently healthy individual 
who lacks the d copies of the x locus in a homozygous 
fashion demonstrates that the d copy genes are dis- 
pensable* 7,241 . The absence of one particular d copy 
gene (A2) in this individual is compensated by V x 
gene(s) of the p copy* 251 . This may be true also for 
other d copy genes since no increase of A chain- 
containing antibodies was found in the individual. In 
serological experiments, kindly performed by R. 
Linke, Munchen, the xlk chain ratio was determined 
to be about 60/40 in both the individual with only the 
p copies and individuals with p and d copies* 261 . 


4 ) Comparison of V x genes within subgroups and 

the derivation ofPCR primers 
The alignments of sequences of the subgroups I, II 
and III are shown in Fig. 2-4. Subgroups IV, VandVII 
consist of one germ-line V x gene each, and subgroup 
VI has three members, the sequences of which have 
been compared previously 110,271 . The sequence differ- 
ences among members of the V x gene subgroups and 
between the subgroups are compiled in Table 2. Mat- 


rices of gene-by-gene comparisons are included in 
ref ,* 261 . It can be seen in Tkble2 that the genes of the 
V X IV- V X VII genes constitute subgroups of their own, 
albeit single-member subgroups in three cases, and 
cannot be assigned to the predominant subgroups 
I-IIL It is also confirmed that theY, gene subgroups 
I -IV correspond closely to the classical subgroups 
I-IVof x proteins* 281 (see below). The differences be- 
tween the various V x gene subgroups are stressed by 
the comparison of some structural features of the 
gene regions in T^ble 3 . 

The alignments of the gene sequences of the major 
subgroups (Fig. 2-4) are an important source of infor- 
mation for all sorts of comparisons and classifica- 
tions. The alignments also present a rational basis for 
the selection of PCR primer pairs. Anumber of prim- 
ers has been suggested on the basis of the V x gene se- 
quences available at the time* 29 " 331 . The arrows in 
Fig. 2-4 and the comments in the respective legends 
should be taken as general suggestions. The specific 
primer sequences to be used depend on the particular 
experiment. On the 5' side the choice of primer de- 
pends on whether one wants to include the regulatory 
sequences, the leader region or only exon 2. On the 3' 
side one has the choice between the closer or more 
distant non-coding sequences. In rearranged V*-J x 
genes, J x consensus or J x -C x intron sequences are 
possible, while for cDNAs a part of the C x gene may 
be the sequence of choice. Amain determinant in the 
choice of primers in PCR with genomic DNA is the 
desired specificity or generality. If one aims at 
amplifying one specific^ gene one may have ro resort 
to using nested primer pairs (e.g. 1121 ). Primer pairs 
that are more or less specific for one of the V x gene sub- 
groups can be constructed. Primer pairs or mixtures 
of different primer pairs for amplifying V x genes of all 
subgroups about equally well are difficult to envisage. 
The search for the desired primer pairs is facilitated 
for the reader by the presentation of separate consen- 
sus sequences of the potentially functional V x genes 
with and without inclusion of the slightly defective 
genes. 

It was not feasible to include in the alignments the se- 
quences of the 25 pseudogenes of the x locus and of 
the 24 known orphon\£ genes. But depending on the 
choice of the primer pair one has to expect to amplify 
in genomic PCR also some of those gene sequences. 
Another caveat one has to keep in mind for PCR with 
multigene families as the V x genes is the possibility of 
PCR artifacts. Structures that probably arose as PCR 
artifacts were described recently for amplification 
reactions with cDN A mixtures 1161 , and they conceiva- 
bly also occur in PCR with genomic DNA when 
primer pairs of low stringency are employed. 
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Table 2. Subgroup classification of the V, genes of the human x locus'. 



I 

17,6,6 
% 

II 

9,5,13 
% 

III 

7,1,6 
% 

IV 

<B3) 
% 

V 
<B2) 

1 , , 
% 

VI 

(A10,A14 
A26) 

% 

VII 

<B1) 

/If 
% 

I 

L 
V 

87- 100 
<>67) 

88- 100 
(>71) 







TT 

L 
V 

78-92 
(>84) 
57-65 
(>64) 

92-100 
84-100 






III 

L 
V 

68-78 
72-78 

59-67 

66-71 
<>68) 

96-100 
(>74) 
92-100 





IV 

L 
V 

54-60 
70-75 

55-59 
68-71 ' 

55 
74-78 

100 
100 




V 

It 

V 

62-68 
61-66 

61-65 
57-61 

63—65 
61-65 

63 

100 
100 



VI 

L 

V 

64-79 
71-73 

61-70 
63-68 

61-65 
70-74 

57-60 
69-70 

55-65 
63 

77-100 
87 


VII 

It 
V 

60-64 
67-70 

57-61 
66-60 

53-57 
68-71 

43 
70 

59 
65 

55-65 
70-71 

100 
100 


B The numbers underneath the subgroup numerals are the numbers of pf , md and ps 
genes in the subgroup. L and V refer to exons 1 and 2. The numbers in the Table are 
percent similarity of potentially functional genes within and between subgroups. The 
values for slightly defective genes and pseudogenes are given in parentheses. 


Table 3. Features of promoter and intron regions of theV K gene subgroups. 


Subgroup 


dc Region" 


Length 
[bp] 


Intron 


Similarity b 

Length b 

[%] 

[bp] 

81-100 

124-126 

(68-100) 

(108-126) 

83-100 

370-440 

(82-100) 

(333-444) 

90-100 

170-190 

(74-100) 

(64-100) <^ 


219 


145 

(80) 

206-210 


.107 


Similarity^ 
[%] 


I 
II 

III 

IV 
V 
VI 
VII 


96-97 
108-126 

103 

103 
112 
113-115 
243 


69-100 
(65-100) 


80-100 
-H168-1446) 


(70) 


* The region from the first nucleotide of dc to exon 1 is compared. 
b Numbers in parentheses refer to pseudogenes. 

c Because of the high variability of length no similarities were calculated for the introns of the VJI-family. 


5) Conserved sequence elements 
One important outcome of the alignment exercise 
(Fig. 2-4) is the definition and evaluation of con- 
served sequence elements. A TATAA-like sequence 
is found 5' of all V x genes, but its exact sequence is 
rather variable. The decanucleotide (dc) sequence 
TNATTTGCAT was recognized early as a function- 
ally important promoter element* 341 ; independently, 
the octanucleotide ATTTGCATwas defined as a con- 
served sequence 1351 . The occurrence of the deca- resp. 
octanucleotide sequence in various gene systems was 
reviewed! 36 " 383 . In transcription assays the last seven 
nucleotides of dc appeared to be essential for promo- 
ter activity, while alterations in the first and third pos- 
ition of dc allowed reduced transcription^ 391 . Con- 
sequently, deviations from the heptanucleotide 
TTTGCATare considered a defect in aV x gene. The 
first three nucleotides of dc seem to be predominantly 
TGA, TGC or GGA (Fig. 2b, 3b, 4b). 

The 15-mer or pd element as defined in ref. [341 is 
found 17 bp 5' of dc in all V X I genes and approximately 
150 bp 5' of dc in A10 and A26, while it is not seen inV x 
genes of other subgroups (Fig. 2a, 3a, 4a). pd is not a 
promoter element essential for transcription* 401 but it 
may have a supportive activity* 411 . Another possibly 
supportive element is CCCT* 411 which is found in one 
or more copies 20-30 bp 3' of dc in almost all poten- 
tially functional V x genes (Fig.2b, 3b, 4b). An ACCC 
element, which was found to bind nuclear pro- 
tein(s) [421 , is less well conserved but is found in many 
potentially functional V x genes on the 5' or 3' side of 
dc, in some genes overlapping with the CCCT ele- 
ment. Several sequence motives that have been de- 
fined as potential binding sites for transcription fac- 
tors (review* 431 ) were found in the sequences by a com- 
puter-aided search* 441 ; they are shown in Fig. 2a and b, 
3b, 4b, but appear not to be highly conserved in V* 
genes. In summary, the heptanucleotide part of dc, 
the TATAA element and the C-rich sequences are 
highly conserved, pd and other sequences upstream 
or downstream of dc may help to increase transcrip- 
tional activity. 

Sequence regularities in the leader and in exon 2 will 
be discussed on the protein level (see below) . Some of 
the conserved regions in the intron (Fig. 2d, 3d, 4d) 
are related to functions in the splice process. A re- 
markable feature are..the .relatively narrow distribu- 
tion of intron sizes within the major V x gene subgroups 
and the pronounced differences in this respect be- 
tween the subgroups (Table 3). 

The recombination signal sequences are well con- 
served in the potentially functional V x genes of the 
major subgroups (Fig. 2f, 3f, 4f). The heptamer 


agrees fully and the nonamer to a large extent with 
the canonical sequences. This is so since by our for- 
mal definition a deviation from the canonical hep- 
tamer sequence is considered a defect that places an 
otherwise potentially functional gene in the category 
of genes with minor defects. This does not exclude 
that such genes do rearrange 1161 , which is in agree- 
ment with the finding that slightly altered recognition 
sites in a model system lead to a reduced recombina- 
tion frequency but do not abolish recombination* 451 . 
Considering mechanistic aspects of V x -J x rearrange- 
ments one should keep in mind that in all but very few 
potentially functional and slightly defective V x genes 
the last codon of exon 2 is a Pro codon; it is always the 
codon CCT, and even in the few non-Pro codons the 
last nucleotide is aT. The first nucleotides beyond this 
codon are, in most genes, C residues. It is an open 
question whether these nucleotides are recognized by 
the V x - J x recombination machinery. 


6) Subgroups of x proteins and the invariant residues 
in the sequences 

It has proven very useful to order immunoglobulins 
into sets of related sequences which in the case of the 
human x chains are called subgroups. Since the early 
studies of Milstein* 461 , Hilschmann* 471 and others the 
subgroup classification has become, mainly by the 
monumentous work of Kabat et al. (ref . [28J and earlier 
editions) a guiding principle of structural immunol- 
ogy. For subgroups I-IVthe classification was fully 
confirmed when the V x gene sequences became 
known. For five germ-line genes that did not fit into 
these subgroups the new subgroups V- VII were de- 
fined. Transcripts of V X V and VI genes have been 
found* 321 . Since no proteins are known that are de- 
rived from the YtV-VII genes it may be argued that 
the one-member families V and VII and the three- 
member family VI are not subgroups in the sense of 
the classical definition 1281 . But a rigid distinction be- 
tween sequence families of genes and subgroups of 
proteins becomes cumbersome, for instance when\k 
gene and protein sequences are compared across 
species boundaries. An example are the correlations 
between certain groups of human and murine\* genes 
(exon 2* 48] ; introns* 261 ) which are interesting for the 
consideration of the evolution of the x locus . It, there- 
fore, appears to be justified to use the subgroup class- 
ification also for sequences that are known only at the 
nucleic acid level. 

The formal translation products of the germ-line 
V^I-III genes were aligned and are presented in 
Fig. 5a-c. Sequences derived from potentially func- 
tional and slightly defective genes are grouped sepa- 
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rately. It is not surprising that most residues defined 
as invariant (i.e. 95% and more conserved within a 
subgroup) in the collection of protein, cDNA and 
gene sequence* 281 are also found here to be conserved. 
The relatively high homogeneity of sequences within 
the subgroups is interesting because of the just men- 
tioned relation between sets of human and mouse 
genes. Apparently, some subgroup-specific features 
of V* genes have been conserved in evolution for a 
long time and divergence is counteracted in the 
species by surveillance mechanisms. 

The presentation of the residues conserved among all 
potentially functional YJ-IV genes (Fig. 5d) em- 
phasizes the residues that are particularly important 
for the structure and/or function of the x chains. Such 
highly conserved residues are seen all along the se- 
quences and in a cluster at the beginning of FR3. 
) They have to be considered with particular attention 
in all attempts of antibody gene engineering. The 
cluster of conserved residues in FR3 may play a role in 
the intracellular transport and secretion of the pro- 
tein, as indicated by experiments with mouse A 
chains* 491 . Its importance is also seen from the fact 
that it is not only conserved between the x subgroups 
and between x and A chains but also in several mam- 
malian species. The same is true for the four residues 
shown with double underlining in Fig. 5d. Their con- 
servation is illustrated by the numbers of light chains 
in which they do or do not occur: Cys 23, 1450/3; Trp 
35, 1242/3; Gly 57, 1133/12; Cys 88, 1134/1. The num- 
bers are derived from the sequence collection of 
Kabatetal. [281 

The leader sequence that functions in intracellular 
transport and/or secretion of the proteins is highly 
} conserved within the subgroups, while between sub- 
groups only some hydrophobic residues in the 3' part 
of the leader sequence are kept invariant. 


7) Some aspects of the evolution of the human x locus 
To discuss evolution is one of the attractions of study- 
ing a multigene familiy like the V x genes. Such discus- 
sions play a role in many reports from other 
laboratories (e.g.* 2,28,331 ) and from our laboratory. 
Some of our earlier findings were discussed in the con- 
text of evolution in a review article* 50 * , for instance the 
duplication of the locus, the interdigitation of genes 
of different subgroups and the relationship between 
orphons and V x genes of the locus. While the evolution 
of most pseudogenes and orphons is dwelled on in 
another report* 171 we like to enumerate here some of 
the reactions or mechanisms that must have operated 
in shaping the present-day x locus; we will then turn 


to the discussion of sequence similarities between V x 
genes and groups of such genes. 

After duplication, interdigitation, conservation of in- 
variant residues and generation of orphons the point 
mutations should be mentioned next. The average di- 
vergence of about 1% between the p and d copies 
which has been mentioned in section 3 is largely due 
to point mutations. Not surprisingly, there are about 
twice as many transitions than transversions, and no 
indication for a pronounced selection was detected 1261 . 
Stop codons have been created in the locus 18 times, 
some of them in genes which are pseudogenes also on 
other accounts. The insertion of a single G residue in 
exon 2 of the gene A4 led to a debilitating frame 
shift* 101 . Clear examples of gene conversion were ob- 
served between some genes of the L region* 2,121 and in- 
dications of information transfer are also seen in 
other regions of the locus. Deletions of gene-contain- 
ing stretches of DNA could well be defined in cases 
where they occurred only in one of the copies.of the 
locus* 101 , or in both copies but with different break- 
points of homology* 111 . Insertions of repetitive ele- 
ments occurred both before and, as in the case of one 
particular Alu element* 101 , after the duplication. 

The similarities between the V x genes of subgroup I 
are particularly high in the L regions, in part certainly 
because of the just mentioned gene conversions. The 
homologies extend far into the downstream region to 
a particular structural element, the so-called L ele- 
ment* 51] , which is found only in this group of VJi genes 
and in the orphon VJ geneV108* 181 . The V X I genes of 
the O regions seem to form, together with some or- 
phon V X I genes, a group of their own. The Y, genes of 
subgroup II are less similar to each other than the 
ones of other subgroups. Also here the L region and 
the O region plus some orphon V*II genes form groups 
of their own. TheYJII genes again constitute a some- 
what more homogeneous group with the possibility to 
define more closely related subsets in the L, A, and O 
regions. The relationships between the genes of the 
three main subgroups are discussed in some detail in 
ref J 261 . Relationship diagrams, which can be drawn in 
all cases, have to be taken with a grain or even a kilog- 
ram of salt if it comes to equating them with evolutio- 
nary trees, not the least because of gene conversion 
and possible other surveillance mechanisms. The 
same is true for the ur-V x gene and the ur-V x I-IV(or 
I- VII) Y, genes which one somehow has to postulate. 
The comparison of human and mouse V H gene se- 
quences indicates that, at least in some cases, sub- 
group formation has predated speciation* 26,481 . The 
evolution of the human x locus will become better un- 
derstandable once the x loci of primate(s) and other 
mammals are known. 
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CniX^TGl-GCT^CTKCCUTacCTGCTGAt 1 
c r 3CAGCTG|"GCTCAG xtgccccatgccctgctgat 

TGTGATGCCCTGGTCCGACCTGGGACAC . TGGGGA . CATTGCTCCTATGCTGAGTTACTGAGAAGAGCCAGCCC T 3CAGCTGfGCCCAG XTGCCCTATCCCCTGC 

C T jCAGCTGiGCCCAG XTGCCCTATCCCCTGC 


TGTGCTACCgTGATCTTACCTGGGACACCTGGOGA . CACTGAGCTGCTGCTGAGTTACTGAGATGAGCCAGCC£ rtsCAGCT 
TGTGCTACCCTGATCTTACCTGGGACACCTGGGGA . CACTGAGCTGCTGCTGAGTTACTGAGATGAGCCAGCCC TdCAGCT 


TGTGCTA^ggTGGTCTTACCTGGGACACCTGGGGA . CAYTGAACTGGTGCTGAGTTACTGAGATGAGCCAGCCC 
TGTGCTA£CgTGGTCTTACCTGGGACACCTGGGGA . CACTGAGCTGGTGCTGAGTTACTGAGATGAGCCAGCCdTpCAGCl 



TG SCCCAG XTGCCCCATCCCCTGC 
T6 rGCCCAG XTGCCCCATCCCCTGC 
iCAGCTG TGCCCAG XTGCCCCATCCCCTGC 
TG TGCCCAG XTGCCCCATCCCCTGC 
TG TGCCCAG XTGCCCCATCCTCTGC 


TG :GCCCAG XTGCCCCATCCCCTGC 


iCAGCTG IGCCCAG XTGCCCCATCCCCTGC 


TGTGCTAggTGGTCTTACCTGGGACACCTGGGGA . CACTGAACTGGTGCTGAGTTACTGAGATGAGCCAGCCQTj 
TGTGCTACCCTGGTCTTTCCTGGGACACCTGGGGA. 


CipCAGC 
CrpCAGC' 


TG TGCCCAG 


GTGCAGGTCGACTCTAGAGG WTCC^ CTG TGCCCAG XTGCCTTGCCCCCTGC 


TGTGCTACCgTGGTCTTACCTGGGACACCTGGGGA . 
TGTGcTaCcctggTCttaCCTGGGACACcTGGGGA. CAcTGagCtggTGCTGAGT- 


. CACTGAGCTGGTGCTGAGTTACTGAGATGAGCCAGCTC T XAGCTG TGCCCAG FCAGCCCCATCCCCTGC 

C r 3CAGCTG TGCCCAG XTGCCCCTTCCCCTGC 
CACTGAGCTGGTGCTGAGTTACTGAGATGAGCCAGCTC T 3CAGCTG rGCCCAG XTGCCCCATCCCCTGC 
tactGagatgagCcAGccc r sCagCTG rGCcCAG XtGCCccatcCcCTGC 


iCTGCCCCATCCTCTGC 


TGCTGAGTTACTAAGAT6AGCCAGCCC T 3CAGCTG TGCTCAG XTGCCCCATGCCCTGC 
. CAGAGATGAGCCAGCCQCECAGCTG TGCCCAG XTGCCCCAACCCCTGC 
. CAGAGATGAGCCAGCCQC 3CAGCTG TGCCCAG XTGCCCCAACCCCTGC 


012 
02 
018 
08 
A30 
LW 
LI 
LIS 
LU/18A 
LS 
L19 
L5/19a 
L23 
L24 
LU 
LL2 

PF-CONS. 

012a 
A20 

m 
m 

U8 
L8 
L22 
L9 

CONS. 

Fig. 2a . 

Fig. 2. Alignment of the gene regions of the potentially functional (pf) and slightly defective (md) V x genes of subgroup I. 
For the boxed and underlined elements see text. Alleles of genes are shown only if they differ from their respective standard gene. 
The upper blocks of sequences show all pf genes and the lower one the genes with minor defects. Consensus sequences are given 
for the pf genes (pf-cons.) and for pf plus md genes together (cons.). Lower case letters are used for nucleotides occuring in more 
than 50% of the positions. A dash indicates that no nucleotide occurs in more than 50% of the sequences at the position. Dotted 
lines indicate gaps found in the sequences. Gene sequences having no obvious defects at the nucleotide level, but showing ex- 
changes of invariant amino acids are placed with the md genes. They are marked by an asterisk. 

a) Approximately lOObp upstream of the dc box are shown. The pd box would be suitable as part of a largely V x I-specific primer. 

b) The 100 bp upstream of the leader sequence are shown. For the boxed and underlined elements see text. L9 is considered to be 
a gene with "minor" defects because of a deletion in the dc box and an altered first ATG in exon 1. The arrow designates one of the 
possible VJ-specific primers. 

c) Exon 1. The 5' part of the arrow may yield a largely V X I- plus VJI-specific primer while the 3' part seems to be more or less V X I- 
plusVJII-specific. 

d) Intron sequences. The possible primer may have a limited subgroup specificity. 

e) Exon 2. In the framework regions several primers are conceivable. 

0 Where known, the first 100 bp downstream of exon 2 are shown . The hepta- and nonanucleotide recognition sites are underlined. 
It may be difficult, because of these sites, to construct subgroup-specific primers. 


TATGCTGCACTGGTCTGACCTGGGAC . CCT GGGGA . CATTGCCCTG . 
TGTGCTAfXCTCCTCTGACTTGGGACAC . TGGGGA. CACGAGGGACTGGCT 
TGTGCTACCCTCCTCTGACTTGGGACAC . TGGGGA . CACGAGGGACTGGC T 

CT. .CTGCA GACTnGGACACCTGGGAGGCACTGGGCCTGTGCAGTGTT^ . 

CT. .CTGCA GACTTTGGACACCTGGGAGGCACTGGGCCTGTGCAGTGTTArrGAGATAAGTCATCTT T 3CAGCTG . 

TGTGCT ACCCT GGTCTTAC C TGGGACACC TGGGGA . CAYTGAACTGGTGCTGAGTTACTGAGATGAGCCAGCCC [ 3CAGCTG TGCCCAgLcTGC 

C f 3CAGCTG TGTCCAG XTGC 


. CACTGCCCCTGTGCTGAGTTACTGAGATAAGCCAGCCA : 3CAGCTG TATCCAG XTGCCCCACCCCCTGC 
. CACTGCCCCTGTGCTGAGTTAC TGAGATGAGCCAGCCA Z 3CAGCTG TATCCAG XTGCCCCACCCCCTGC 


TAGG . TGCCCTGGTCTGACCTGGGACAC . TGGGGA . C 
TAGG. TGCCCTGGTCTGACCTGGGACAC . TGGGGA . C 

TGTGCTACCCTGGTCTTACCTGGGACACCTGGGGA . CAYTGAACTGGTGCTGAGTTACTGAGATGAGCCAGCCC [ SCAGCTG TGCCCAG XTGC . CCATCCTCTGC 
tgtgcTaccctggtcttaCcTgGGACaCcTGGGga.O^tgagctggtGCtcsagttactgagatgagccAgccc r sCagCTG tgcccag xtgccccatccccTGC 


00 


.... TGC 
....TGC 
.CCATCCTCTGC 
.CCATCCTCTGC 
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08 
A30 
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U 
LL5 
LM/18A 
L5 
U9 
L5/19A 
123 
L2*l 
L2^a 
Lll 
L12 

PF-CONS. 

012a 
014* 

A20 
A<* 
Lft 

LL8 
L8 

122 
19 

CONS* 


rCATTTGCAT 3 rTCCCAG \GCACMCCTCC. . TGCCCTG. AAGCC rTATTAA TA 3GCTGGT ZAGACTTTC FGCAGGAAT CAGACCCAGTCAGGACACAGC 
rCATTTGCAT I rTCCCAG VGCACAACCTCC. . TGCCCTG. AAGCC rTATTAA TA 3GCTGGT ^Cm qrGCAG<W 
rCATTTGCATp ITCCCAG \ACACAACCTCC. . TGCCCTG. AAGAC TTCTTAA TA 3GCTGGT !ACACTTCTTGCAGGAGTCAGACCCACTCAGGACACAGC 


fCATTTGCAT 3 rTCCCAG \GCACAGTCTCC . . TGACCTG . AAGAC ITATTAACA jGCTGATCACACCCTGTGCAGGAGTfrUSACCCAGTCAGGACACAGC 

I -* _ -i _ 1 

rCATTTGCAl 3 TTCCCAC^GCACAGTCTCC . . TGACCTG . AAGAC ITATTAACA 3GCTGATCACACCCTGTGO^ 
TCATTTGCAT ft TTCCCAG ttCACAACCTCC. .TG.CCTG. AAGAC rTCTTA/jfA 3GCTGGT£ ACACTTCnGCAGGAGTCAGAtt^ 


FGATTTGCAT 3|TTCGCAG AGCACA qCCCCC. .TGCCjCT G. AAGAC rTATTAA [A ^mpXA^G^fi^AGJO^KC(MCama<a^ 
rCATTTGCAT 5 ITCCCAGftGCACAGCCTCC. . TGCCCTG . AAGCC TTATTAAjTA ^GtWCACACTTCA^^^g^GTCCCACTCAGGACACAGC 
rCATTTGCAT 3 rTCCCAG U3CATAGCCTCC . . TGCCCTG. AAGCC TTATTAATA 3GCTGGACACACTTCATGGAGGAATCAGTCCCACTCAGGACACAGC 


TGATTTGCA 


HAPF 

L 


AP2 TATA SP1 NFIL6 

rCATTTGCAT sjrTC . CAGAGCACAC ^CCCC , .TGCCC TG I AAGAC rTTTTTA ?G 3GCTGGT :GCAgCTGTGCAGGAGTCAGTCTCAGTOUXaACACAGC < 
TGATTTGCA1 3TTC , CAGAGCACA GCCTCC. .TGCCCTG . AAGAC rTTTTTA TG 3GCTGGT :GCAg^GTGCAGGAGTCAGTCTCAGTCAGGACACAGC 
rGATTTGCATpTTCGCAGAGCACAG CC CCC . . TG CQCTG ■ AAGAC rTATTAA TA 3GCTGGT -GCACCCTGTGCAGGAGT CAGT CCCAACCAGGACACAGC 

I — — — J ] mdSSS | 


»T 3TT< 
I 


TAATTTGCAT 3 rTCCCAG HGCACATCCTCC . . TACCCTG .AAGAC ITATTAATC CGCTGGTCACACTTCATGCAGGAGTCAGACCCAGTCAGGACACAGC 
TCATTTGCA7 3 rTCCCAG HGCACAACCTCC. . TGCACTG. AAGCC TTATTAATA 3GCrGGCWCACTTWTGCAGGAGTCAGACCCAGTCAGGACACAGC 
rCATTTGCAT jjTTCCCAG AGCACAACCTCC .TGCACTGCAAGCC TTATTAATA 3GCTGGCCACACTTCATGCAGGAGTCAGACCCAGTCAGGACACAGC 


fCATTTGCAT 3 ITCCCAG U3CACAGTCTCC. . TGACCTG. AAGAC rTATTAACA 3GCTGATCACACCCTGTGCAGGAGTCAGACCCACTCAGGACACAGC 


100 


rCATTTGCAT G rTCCCAGft GCACAACCTCC. . TGCCCTG. AAGCC (TATTAA fr/^CTGGTP OCTnGTGCAGGAGTCAGACTCAGTCAGGACACA^ 
TcATTTGCAtIsTTCcCAGAgCAcAgcCtCC. . ToccCTG. AAGaC ITaTTaAta sGCTGgtCacACtttgTGcAGGAgTCAGaCcCAgtCAGGACACAGC. 


C. CAGAGCACAGCC CCC. . T G CCCTG . AAGAC 


ITTTTTA 


rGATTTGCAT 5TTCCCAGAGCACAGCC CCC . . TGCCCTG . AAGTC ITCTTAACA jGCTGGTCACACCCCGTGCAGCAGTCAGTCCCAGTCAGGACACAGC 
TGATTTGCAT 3TTCCCAGAGCACAGCCCCC . . TGCCCTG. AAGTC rTCTTAACA jGCTGGTCACACCCCGTGCAGCAGTCAGTCCCAGTCAGGACACAGC 


rCATTTGCAT rTCCCAG AACACAACCTCC . . TGCCCTG. AAGAC ITCTTAA 


rCATTTGCAT ITCCCAG AACACAACCTCC . . TGCCCTG . AAGAC 


.TGCCCTA. AAGAC 


1 


:gcactctgtgcaggagtcagtctcagtcaggacacagc 


FGATTTGCAT 3TCC . CACAGAGCAACGCCTACTGCCCTG . AACAT ITATCAA rA 3GCTGGT 3ACATCCTGTGO«3AAGTCTCTCTCAOTCAGGACACAK 
fU3ATTTGCA7pTCC . CACAGAGCAACGCCTACTGCCCTG . AACATnATCAA rA 3GCTGGT 3ACATCCTGTGCAGAAGTCTCTCTCAGTCCAGACACAGC 

TA 3GCTGG7 ZACACnCTTGDVGGAGTCAGACCCACTCAGGACAWGC 


^TnGCTTtrrCCCAqftGCACCACCCCC . . TGCCCT A.MGACrrcnAAiTACGCTGGTtACACCT . GTGCAGGAGTCAGTCCCAGTCAGGACACAGC 
XTnTGCTTHrTCCCA»GCACCACCC££ . 


ITCTTAAfA 3GCTGGT] CACACCT . GTGCAGGAGTCAGTCCCAGTCAGGACACAGC 

fTCA ACCTCC . . TGCCCTG. AAGAC rTATTAATA SGCTGGACACACnCATGCAGGAGTCA^CCCTGTO^ 

f catttgca'HgttcccagagcacaacctCc * . TgccCTg.AAgac TTaTtaAta 3GCTGgtcacActt^TGcAGgAgTCagaCcCagtCagGAQ\D\GC 


ITCTTAA rA 3GCTGGT !ACACTTCTTGCAGGAGTCAGACCCACTCAGGACACAGC 


Fig. 2b 
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018a 
08 

A30 

LW 

& 
U5a 
L5 

L3/19A 
L19 
L23 
L2<4 
Lll 
L12 

PF-CONS. 


55 


ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTACTCTGGCTCCGAG 
ATGGAWT6AGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTACTCTGGCTCCGAG 
ATGGACATGAGGGTCCCTGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGCTCTCAG 
ATGGACATGAGGGTCCCTGCTCAGCTCCTGGGGCTCCTGCAGCTCTGGCTCTCAG 


ATGGACATGAGGGTCCCTGCTCAGC 


CCTGGGGCTCCTGCTGCTCTGGCTCTCAG 


ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGTTCCCAG 
ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGKTCCTKTOTCTOTTCCC^ 
ATGGACATC^GAGTCCTCGCTCAGCTCCT^CTCCTGCTGCTCTGn 
ArGGACATGAGGGTCCTCKTCAGCTCCTGGGGCTCCTKTGCTCTGTn 
ATGGACATGAGGGTCCTCGCTCAGCTCCTGGGGCTCCTO^ 
ATGGACATGAGGGTCCCCGCTCAOTCCTGGGGCTCCTGCTKT 
ATGGACATGATGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGTTCCCAG 
ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGKTCCTKTKTCT^nCCCAG 
ATGGACATGaGGGTGCCCGCTOUKGCCTGGGGCTCCTO 
ATGGACATGAGGGTCCCCGCTWGCTCCTGGGGCTCCTOTGCTCTKCTCC 
ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTK^ 
ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGTCTCCTK^ 
ATGGACATGAggGTcCcc GCTCAGCtCCTGGGGCTCCTGCtgCTCTGgtTCcc AG 

012a atggacatgagggtccccgctcagctot 
om* atggacatgagggtccccgctcagctcctggggctcctactktctgktccw^ 

W ATGGACATGAGGGTCCCCGCTCAGCTCCTGG(^CTCCTACTGCK 

A20 ATGGACATGAGGGTCCCTGCTCAGCTCCTGGGACTCCTGCTrcTCTGKTCCCAG 

M ATGGACATGAGGGCCCCCGCTCAGCTCCTGGGACKCTKTGCTCTG^TCCW 

m ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTTCTOT^TCTGKTCCC^ 

L18 ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCnCTGCTgTCTGKTCCCAG 

L8 ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTG^TCCCAG 

L22 ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGCTCCCAG 

L9 ATAGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCT^TCCCAG 

cons . ATGGACATGAggGtcCccGCTCAGCtCCTGGGgCTcCTgCtgCTCTGgcTCccAG 


Fig. 2c 


012 gtaaggatggagmcactaggaatttactwc.cagtgto:tcagtactc^ctgg^ 

02 gtaaggatggagaacactaggaatttactcagc . cagtgtgctcagtactgactggaacttcagggaagttctctgataacatgattaatagtaagaa.t 

018 gtaaggaaggataacactaggaattttctcagc . cagtgggctcagtacagcctggctcttgatggaagccttcctataatatgactaatagtatgaa. t 

018a gtaaggaaggataacactaggaattttctcagc . (^gtgtgctcagtacagcctggctcttgatggaagccttcctataatatgactaatagtatgaa, t 

08 gtaaggaaggataacactatgaattttctcagc . cagtgggctcagtacagcctggctcttgatggaagccttcctataatatgactaatagtatgaa, t 

A30 gtaaggatggagmcactagcagtttactcagcccagagt^ c 

im GTAAGGATGGAGAACACTAACAGTTOCTCAGCCCAGAGTKTCAGTACTGCm C 

LI GTAAGGATGGAGMC^CTAGCAGTTTACTCAGCCCAGGGTGCTCAGTACTGCTTC . C 

L15 GTAAGGATGGAGMCACTAGCAGTnACTCAGCCCAGGGTGCTCAGTACTGCTTTACTAnCAC^^ . C 

m/lBA GTAA3GAAGGAGAACACTAGCANTTTACTCAGCCCAGGGGGCTCAGTA 

15 GTAAGAAAGGAGAACACTAGGATTATACTCGGT . CAGTGTGCTGAGTACTGCTTTACTATTCAGGGAACTTCTCTTACAGCATGATTAATTGTGTGGA.C 

L19 GTAAGAAAGGAGAACACTAGGATTATACTCGGT . CAGTGTGCTGAGTACTGCTTTACTATTCAGGGAACTTCTCTTACAGCATGATTAATTGTGTGGA . C 

L5/19a GTAAGA7VAGGAGMCACTAGGAATTTACTCAGCCCAGTGTGCTGAGTACTGCTTTACTATTCAGG . C 

L23 GTAAGGAAGGAGMCCCTAGCAGTTTACTWGCCCAGTGTGTTC^^ C 

L2i* GTAAGGAAGGAGAACACTATGAATTTACTCAGC . CMTGTGCTCAGTACAGCCTGGCCCTTCAGGGAMTTCTCTTACTACATGATTAATTGTATGGA. T 

Lll GTAAGGATGGAGAACACTGGCAGTT7ACTCAGCCCAGGGTGGTCAGCACAGCCTGGCTATTCAG^ 

L12 GTAAGGAAGGAGAACACTAGGAATTTACTQU3CCCAGTGTGCTCAGTACTGCCTGGTTA . T 

L12A TGTGAA.T 

pf-cons . GTAAGgMGGAgMUCTagc^-TtTaCTCaGccCAgtGtGcTcaGtACtGcc^ . c 

OM* GTAAGGAAGGAGAACACTAGGAATTTACTCAGC . TGGTGCGCTGAGTATAGCCTGGCTCTTCAGGGAGGtCTTCTTATAACATGATGGAiTGAATGGA . t 

CW • GTAAGGAAGGAGAACACTAGGAATTTACTCAGC . TGGTGCGCTCAGTATAGCCTGGCTCnCAGGGAGGTCTTCTTATAACATGATGGATTGAATGGA . T 

A20 GTAAGGAGGGAAACAACAAAAATTTTATTCAGC . CAGTGTAGCCACTMTGCCTGGCACTTCAGGAAATTCTTCTTAGAACATTACTAATCATGTGGA . T 

M GTAAGGAGGGAAACAACAAAAATTTTATTCAGC . CAGTGTAGCCATTAATGCCTGGCACTTCAGGAAATTCTTCTTAGAACATTACTAATCATGTGGA . T 

LM GTAAGGAAGGAGAACACTAGC^TTTACTCAGCCCAGGGGGCTCAG 

L18 GTAAGGAAGGAGAACACTaA(^TTTACTCAGCCCAGGGGGCTCAGTACAGCCTGK 

L8 GTAAGGAAGGAGMCACTAGGAATTTACTCAGCCCAGTGTGnCCGTACAGCCTGGCTCnGAGGGAAGTTCTCn . C 

L22 GTAAGGAAGGAGAACACTAGGAATTTTCTTAGCCCACTGTGCT CTGGCACTTCTGGGAAGTTCTCTTATACCATGATTCATGGTGTGGA . T 

L9 GTMGGAAGGAGAACACTAGGMTTTACT(^CCCAGTGTGCTTGGTAOVGCCTGGCCCTTCAGGGAAGnCTCnACM . C 

cons . GTAAGgAaGGAgAacaCtaggA-TtTac TcaGcccagtGtgctcagtAcagccTggctcTTcagGgAa-ttcTCttAcaacATgAttaATtgtgtGgA . c 


100 


012 
02 
018 
08 
018a 
A30 
Ll<4 
U 
U5 
W/18a 
L5 
L19 
L5/19A 
L23 
L24 
Lll 
L12 
L12A 

PF-CONS , 


ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTGTTTATGTTTCTAATCGCAG 
ATTTGTGTTTATGTTTCTAATCGCAG 
ATTTGTGTTTATGTTTCTAATCGCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTGTTTATGCTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTGTTTATATTTCCAATCTCAG 
ATTTGTTTTTATGTTTCCAATCTCAG 
ATTTGTTTTTGTGTTTCCAATCTCAG 
ATTTGTGTTTCTATTTCCAATCTCAG 
ATTTGTGTTTCTATTTCCAATCTCAG; 
ATTTGTtTTTaTgtTTCcAATCtCAG 


126 


CONSERV 


oia* GTTTGTTTTTATATTTCCAATCTCAG 
W GTTTGTTTTTATATTTCCAATCTCAG 
A20 ATGTGTTTTTATGTTCCTAATATCAG 
A^ ATGTGTTTTTATGTTCCTAATATCAG 
m ATT TGTGT TT ATGCTTCCAATC TCAG 
L18 ATTTGTGTTTATGCTTCCAATCTCAG 
L8 ATTTGTGTTTATATTTCCAATCTCAG 
L22 ATTTGTTTTTATGTTTCCAATCTCAG 
L9 ATTTGTTTTTATGTTTCCAATCTCAG 
aTtTGTtTTTaTgtTTCcAATctCAG 


Fig. 2d 
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L' FR1 CDR1 

1 1 ! 

f . I . . . . I . . 100 

012 GTGCCAGATGTGACATCCAGATGACC(^TCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCA^ 

02 GTGCCA&VTGTGACATCCAGATGACCCAGTCTC 

018 GTGCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGCCAGGCGAGTCAGGA 

08 GTGCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCA^ 

A30 GTGCCAGGTGTGACATCCAGATGACCCAGTCTCCATCCTCCCTCT 

LIU GTGCCAGATGTAACATCCAGATGACCCAGTCTCCATCTGCCATGTCTGCATn^ 

LI GTGCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCACTGTCTGCATCTCT 

L15 GTGCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCACTGTCTGCATCTGTAGGAGAC^GAGTCACCAT^Cn 

L15a GTGCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCACTGTCTGWTCTGTAGGAGACAGAGTCACCATC^ 

LU/IBa GTGCCAGATGTGCCATCCAGTTGACCCAGTCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACWTCACTTGCCG 

L5 GTT CCAGATGC6ACATCCAGATGACCCAGTC TC CATCTTCCGTGT CTGCATC TGTAGGAGACAGAGT CACCATCAC TTGT CGGGCGAGT CAGGGT ATT AG 

U9 GTTCCAGATGCGAWTCCAGATGACCCAGTCTCCATCTTCTGTGTCTGCATCTGTAGG 

L23 GTGCCAGATGTGCCATCCGGATGACCCAGTCTCCAnCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCAC 

L2<1 GTGCCAGATGTGTCATCTGGATGACCCAGTCTCCATCCTTACTCTCTGCATCTACAGGAGACAGAGTCACCATCAGTTGTCGGATG^ 

Lll GTGCCAGATGTGCCATCCAGATGACCCAGTCTCCATCCTCCCTGTC^ 

L12 GTGCCAMTGTCy\CATCCAGATGACCCAGTCTCCTTCCACCCTGTCTGCATCTGTAGGACWCA&^ 

L12a GTGCCAAATGTGACATCCAGATGACCCAGTCTCCnCCACCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGCCGGGCC^ 

pf-cons. GTgCCAgaTGtgaCATCcaGaTGACCCAGTCTCCaTcctcccTgTCTGCATC 

012a gtgccagatgtgacatccagatgacccagtctccatccttcctgtctgcatctgtaggagacagagtcac^ 
014* gtgccagatgtgacatccagttgacccagtctcwtcctccctgtctgcatctgta^ 

gtgccagatgtgacatccagttgacccagtctccatcctccctgtctgcatctgtaggagacaga^ 
A20 ataccagatgtgacatccagatgacccagtctccatcctccctgtctgcatctgtacgagacagagtc^c 

to ATACCAGATGTGACATCCAGATGACCCAGTCTCCATCaCC^ 

W GTGCCAGATGTGCCATCCAGTTGACCCAGTCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGCCGGGCAAGTCAGG 

L18 GTGCCAGATGTGCCATCCAGTTGACCCAGTCTCCATCCTCCCTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGCCGGGCAA 

LB GTGC CAGATG TGACATCCAGTTGACC CAGTC TCCA TCCTTCC TGTCTGCATCTGTAGGAGACAGAGTCACCATCAC T TGC CGGGCCAGTCAGGGCATTAG 

L22 GTGTCAGAnTC^CATCCAGATGATCC^TCTCCATCTTTCCTGTCTGCATCTGTAGGAGACAGAGTCAGTATCATTTGC 

L9 GTGCCAGATGTGCCATCCGGATGACCCAGTCTCCATCCTCATTCTCTGW 

CONS . gTgCCAgaTgtqaCATCcaGaTGACCCAGTCTCCaTcctcccTgTCTGOT 


FR2 CDR2 FR3 

I I j 

012 CAGCTATTTAAArrGGTATOVGCAGAAACCAGGGAAAGCCCCTAAGCTCCTW 

02 CAGCTATTTAAATTGGTATCAGOVGAAACCAGGGAAAGCCCCTA^TCC 

018 CMCTATTTAAATTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCT^ 

08 CMCTATTTAMTTGGTATO\GCAGAMCCAGGGAAAGCCCCTAA 

A30 AMTGATTTAGGCTGGTATC^GCAGAMCCAGGGAAAGCCCCTAAGCGCCT^TCTAT 

UL4 CMTTATTTAGCCTGGmCAaAGAAACDWSGGAAAGTCCCTAA^ . 

U CAA HAT rTAGCC TGGTTTCAGCAGAAACCAGGGAAAGCCCCTAAGTCCCTGATCTATGCTGCATCCAGTTTGCA^^ . 

LL5 CAGCTGGTTAGCCTGGTATCAGCAGAAACCAGAGAAAGCCCCTAAGTCCCTGATCTATOT , 

LIS A CAGCTGGTTAGCCTGGTATCAGCAGAAACCAGAGAAAGCCCCTAAGTCCCTGATaATGCTGCATCCAGm . 

W/18a CWjTGCTTTAGCCTGGTATCAGOVGAMCCAGGGAAAGCTCCTAAGCTCCTGATCTATGATGCCTCC^ . 

L5 CAGCTGGrTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATC . 

L19 CAGCTGGnAGCCTGGTATCAGCAGAMCCAGGGAAAGCCCCTAA^ 

L23 CAGTTATHAGCCTCKTATCmAAAMCCAGDVAAAfiCCCCTAAK . 

L24 C^nAm/WCCTGGTATCAGCAAAAACCAGGGAAAGCCCCTGAGC . 

LH A^TGATTTAGGCTGGTATCAGCAGAAACCAGGGAAAfiCCCCTAAGCTCCTGATCTATGCTGWTCW 

L12 TAG C TGG TTGGC C TGGTATCAGOU3AAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGATGCCT . 

L12a TAGCTGGTTGGCCTGGTATCAGOUjAMCCAGGGAAAGCCCQ . 

pf- cons ■ cAgctatTTagccTGGTaTCAGCAgAMCCA£ggAAAGccCCTaA£ct^^ 


200 

TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 
TCCCATCAAGGTTCAG 


012a CAGCTATTTAAATTGGTATCAGCAGAMCCAGGGAAAGC^ TCCCATCAAGGTTCAG 


014 • CAGTTATTTAAATTGGTATCGGCAGAAAO 
04 • CAGTTATTTAAATTGGTATCGGCAGAAACL . 
" CAATTATTTAGCCTGGTATCAGCAGAAACCAGGGAAAGT' 


. A20 
Att 
IA 

U8 
L8 

L22 
L9 

CONS-* 


CC TAAGCTCCTGATCTATAGTGCATCCAATTTGCJATCTGGAG . JCCCATCTCGGTTCAG 
CCTAAGCTCCTGATCTATAGTGCATCCAATTTGCAATCTGGAG . TCCCATCTCGGTTCAG 
CCTAAGCTCCTGATCTATGCTGCATCCACTTTGCAATCAGGGG . TCCCATCTCGGTTCAG 
" " — " ~ ' * " "TCAGGGGGTCCCATCTCGGTTCAG 



CAGTTATnAGCCTGGTATCAGCAAAAACCAGGGAAAKCCCTAAGCTCCTWTCTATK 

CAGTAATHAGCCTGGTATCTGCACWUCCAGGGAAATCCCCTAAGCTCnCCTCW 

CAGTTATTTAGCCTGGTATCAGCAAAAACCAGGGAMGCCCCTAAGCTCCTGATCTATGCT^ 

cAgttat TTagccTGgTaTCaGCAgAAACCAGggAAAGccCCTaAGctCcTgcTCTAtgcTGCaTccagTTTgcAaagtGGgG . TCcCATCaaGGTTCAG 


Fig.2e 
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CDR3 

012 TGGCAGTGCWTCTGGGACAGATTTCACTC^ 

02 TGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGTCTGCMCCTGAAGATTnGCAACTTACTACTGTCMCAGACT 

018 TGGAAGTGGATCTGGGACAGATTTTACTnCACCATCAGCAGCCTGCAGCCTGAAGATATTGC^ 

08 TGGAAGTGGATCTGGGACAGAniTACTTTCACCATCAGCAGCCTGCAGCCTG 

A30 CGGCAGTGGATCTGGGACAGMTTCACTCTCACAATCAGCAGCCTGCAGCCTGAAGATTnG^ 

LV* CGGCAGTGGATCTGGGACAGAATTCACTCTCACAATCAGCAGCCTGCAGCCTGAAGATTnGCAACTTATTACTGTCT^ 

LI CGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCMCTTAnA 

L15 CGGCAGTGGATCTGGCttCAGATTTCACTCTCACCATCA^^ 

L15a CGGCAGTGGATCTGGGACAGATTTCACTCTCACWTCAGCAGCCTGCAGCCTGAAGATTHGCMCTTATTACTG^ 

m/lBA CGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGa 

L5 CGGCAGTGGATCTGGGACAGATTTCACTCTttCCAT^ 

L19 CGGCAGTGGATCTGGGACAGATTTCACTCTCACTATCAGCAGCCTGCAGCCTGA^^ 

L23 CGGCAGTGGATCTGGGACGGATTACACTCTCACCATOU^ 

L24 TGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGTTGCCTGCAGTCTGAAGATnTGCMCTTATTACTGTC^ 

Ul CGGCAGTGGATCTGGaWGATTTWCTCTWCCATCAGCAGCCTGCAGCCTQ 

U2 CGGCAGTOWTCTGGGACAGAATTCACTCTCACCATCAGCAGCCTtKAGCCTGATGAT^ 

L12a CGGCAGTGGATCTGGGACAGAAnCACTCTCACCATCAGCAGCCTGCAGCCTGATGAnnGWCTTATT^^ 

pf-cons. cGGcAGTGGATCTGGgACaGAtTtc^CTcTCACcATCAGcaGcCTGCAg 

oi2a tojcagtgcwtctgggacagatttcactctcaccatcago^tctgcmcctgaag^ 

oi4 • tgc^agtggatctg^cagatttcactctcactatovgovgcctgcagcctgaa gatgttgcaacttattacggtcaacggacttacaatgcccc r 


cw« tggcagtggatctgggacagatttcactctcactatcagcagcc 

A20 TGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCC 

AM TGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCC 

m CGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCC 

L18 CGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCC 


TGCAGCCTGAAGATGTTGCAACTTATTACGGTCAACGGACTTACAATGCCCCT 
' GCAGCCTGAAGATGTTGCAACTT ATTACTGTCAAAAGTATAACAGTGCCCC T 
TGCAGCCTGAAGATGTTGCAACTTATTACTGTCAAAAGTATAACAGTGCCCCT 
TGCAGCCTGAAGATTTTGCAAC TTATTACTGTCAAC AGTTTAAT AA T TACCC T 
TGCAGCCTGAAGATTTTGCAACTTATTACTGTCAACAGTTTAATAATTACCCT 
L8 CGGCAGTGGATCTGGGACAGAATTCACTCTCACAATWGCAGCCTGQ^CTGAAGATTTTGCAACn 
L22 TGGCAGGGGATCTGGGACGGATTTCACTCTWCCATCATCAGCCTGAAGCCT^^ 
L9 CGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCTGCCTGCAGTCTGAAGATTTTGCAACTTATTACTGTCMCAGTATTATAGTTAC 
cons. cGGcAGT(X^TCTGGgACaGAtTtcACTcTCACcATCAGcaGcCTG<^gcCTG^ 

Fig.2e Continued 


... . 100 

012 CCCACAGTGTTACAAGTCATAACATAAACC . tccaaggaagcagatgtgtgaggacgagcxaccccagatgctcctcctggtgcctccatctgctgagag 

02 CCCACAGTGTTACAAGTCATAACATAAACC . TCCAAGGAAGCAGATGTGTGAGGACGAGCCACCCCAGATGCTCCTCCTGGTGCCTCCATCTGCTWC^ 

018 CCCACAGTGTAA(^AGTCATAACATAAATWCCCAGGGGAGCAGATGCGTGAGGCTD\GCTGTCC .GCTGCTGAGAA 

08 CCCA(^TGTMCAAGTCATMCATAMraCCCAGGGGAG( 

A30 CCCACAGTGTTACACACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGAGGCTGGGCTGCCCCAGCTGCTCCTCCTGATGCCTCCATCAGCTGAGAT 

LW CCCACAGTGTTACACACCCGAACATAAACC . CCCAGGGAAGAAGATGTGTGAGGCTGGGCTGCCCCAGCTGCTCCTCCTGATGCCTCCATCGGCTGAGAT 

LI CCCACAGTGTTACACACCCAAACATAAACC . CCCAGGGAAGCAGATGTGTGAGGCTGGGCTGCCCCAGCAGCTTCTCCTGATGCCTCCATCAGCTGAGAG 

L15 CCCACAGTGTTACACACCCAAACATAAACC . CCCAGGGAAGCAGATGTGTGAGGCTGGGCTGCCCCAGCTGCTTCTCCTGATGCCTCCATCAGCTGAGAG 

m/lSA CACACAGTGTTACAAACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGAGACTGGGCTGCCCCAGCTGCTTCTCCTTATGCCTCCATTGGCTGAGAG 

L5 CCCACAGTGTTACCAACCCGAACATAAACC . CCO\GGGAAGOVGATGTGTG*AGCrGGGCTGCCCCAGCTGCTO 

L19 CCCACAGTGTOCCAACCCGAAC^TAMCC.CCCAGGGAAGCA^ 

L5/19A TCCACAGTGTTACCAACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGAAGCTGGGCTGCCCCAGCTGCTCCTCCTGATGCCTCCATTGGCTGAGAG 

L23 C CCACAGTGTTAC AC ACCCAAACAAAAACC . CCCAGGGAAAGAGATGTGTGAGGCTGGGCT 

L2<4 CCCACAGTGTTACACACCCGAACAAAAACC.CCCAGGGAAGCAGATGTGTAAGGCTGGGCTO 

L2H a CCCACAGTGTTACACACCCGAACAAAAACC . . CCAGGGAAGCAGATGTGTAAGGCTGGGCTGCCCCAGCTGCTCCTCCTGATTCCTTCATTGCCTGAGAG 

Lll CCCACAGTGTTACACACCCGAACAGAAACC . C C CAGGGAAGCAGATGTGTGAGAC TGGGC TGCCCCAGC TGCTCC TCCTGATGCCTCCA T TGGC TGAGA T 

112 CCCACAGTGTTACACACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGAGGCAGGGCTGCCCCAGCTGCTCCTCCTGATGCCTCTATCAGCTGAGAG 

L12a CCCACAGTGTTACACACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGA . GCAGG 
pf-cons . ^ccCACAGTGTtACacacCcgAACAtAAAcC . cCCAgGGaAg<^TGtGT<^6GCTGgGCtgcCCWGctGCtcCTcCTgaTgCCTccatc-gCTGAGAg 

012a CCCACAGTGhACAAGTCGtAACATAAACC . TCCAAGGAAGCAGATGTGTGAGGACGAGCCACCCCAGATGCTC^^ 

0W« CCCACAGTGTTACAAGGCATAAWTAAACCCCCCAAGGAAGCAGATGTATGGGGCTGGCCT^ 

W • CCCACAGTGTTACAAGGCATMCATAAACCCCCCAAGGAAGCAGATGTATGGGGCTGGCCTGCCCCAGA^ 

A20 CCCACTGTGATACAAGCCC<V\ACATAAACCATGGAGGGAAGTAGATGTGTGAGGCTGGGCTGCCCCAGCTGCTCCTCCTGG 

A4 CCCACTGTGATACAAGCCCGMWTAAACWTGGAGGGAAGTAGATGTGTGAGTCTGGGCTGCCCCAGCTGC 

A^a CCCACTGTGATACAAGCCCGAACATAAACCATGGAGGGAAGTAGATGTGTGAGTCTGGGCTGCCCCAGCTGCTCCTCCTGGTGCCGCCGTCTGCTGACAG 

m CACATAGTGTTACAAACCCGAACATAAACC . CCCAGGGAAGCAGATGTGTGAGACTGGGCCGCCCCAGCTGCTTCTCCTGATGCCTCCATTGGCTGAGAG 

L18 CACATAG TGTTACAAACCCGAACA TAAACC . CCCAGGGAAGCAGATGTGTGAGACTGGGCTGCCCCAGCTGCTTCTCCTGATGCCTCCATTGGCTGAGAG 

L8 CCCACAGTGTTACAAACCTGAACATAAACC . CCCAGGGAAGCAGACATGTGAGGCCGGGCTGCCCCAGCTGCTCCTCCTGATTCCTCCATCAGCTGAGAG 

L22 CCCACAGGGTTACMGCCTGMCATMGCCCCTTAGTGAAA 

L9 CCCACAGTGTTACACACCCAAACAAAAACC . CCCAGGGAAGCAAATGTGTGAGGCTGGGCTGCCCCAGCTGCTCCTCCTGATGCCTCCATCGCCTGAGAG 

cons . ccCAcaGtGttACaaacCcgAACAtAAAcC . cccAggGaAgcAgAtgtgTgaggctgggCtgcCCCAGctgCtcCTcCTgaTgCCtccatc-gCTgAgAg 


Fig.2f 
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27 

Oil CTTTTCCACACCACTGCACCCACCAGG 

01 CTTnCCACACCACTGCACCCACCAGG 

A17 TCmCCACgCACTGCACCCACCAGG 

Al TCmCCAC^CAaACACCCACCAGG 

A2 TCmCCACACCACTGCACCCACCAGG 

A19 TCTTTCCACACCACTGCATGCACCAGG 

A3 TCTTTCCACACCACTGCATGCACCAGG 

A23 TATTTCTACACCACTGCACCCACCAGG 

pf-cons. tcTTTCcACaCCACTgCAccCACCAGG 

A18 TCTTTCCACACCACTGCACCCACCAGG 

A7» TATTTCTATACCACTGCACCCACCAGG 

A21 GCTTAGCAT . . CACTGCAC . CACCGGG 

A29 TCmCCACCCCAaG^CCWCCAGG 

A13 TCmCCACCCCAaC O^CC ACCAGG 

UL3 TATTTCTTCACTGCTTCACTCACCAGG 

cons. tcTTtccacaccaCTgCAccCACCaGG 


Fig. 3. Alignment of the sequences of the V X II gene regions. 
The presentation is analogous to Fig. 2. 

a) The conserved part of the region upstream of dc. Due to 
many insertions and deletions sequences further upstream are 
difficult to align. The last nucleotides of this region can be used 
together with the dc box to construct a primer with V X II plus 
V X III preference. 

b) Sequences upstream of the leader region. One possible 
primer is shown. 

c) Exon t For a possible primer sequence see Fig. 2c. 

d) 5' and 3' parts of the introns. Since the large middle section 
of the introns is rather divergent an alignment seems not to be 
meaningful. For a possible primer sequence see Fig. 2d. 

e) Exon 2. For primers see Fig. 2e. 

f) Where known, the first 100 bp downstream of exon 2 are 
shown. The hepta- and nonanucleotide recognition sites are un- 
derlined. For primers see Fig. 2f. 


) Fig. 3a 


dc 


Oil 
01 

A17 
Al 
A2 

A19 
A3 

A23 

PF-CONS . 

A18 
A21 

A7» 
A29 
A13 
L13 

CONS. 

Fig. 3b 


3GATTTGCAMTTGTCCCACAGQGAGGACC . . TTCCCTTGTGAGTCTC AG VTAA W .GCT CAGCTGT AAC . TGTGCCnGACTGATCAGGACTCCTCAGTTCACCTTCTCACA 
GGATTTGCAT ATTGTCCCACAGGGAGGACC . . TTCCCTTGTGAGTCTC AG VTAA \A. GCT CAGCTGTI aAC . TGTaCTTGACTGATCAa^CaCACfrTCACOTaCAU 


3GATTTGCA1 AU CTCCCCAOSCi AGGACC . . ncCCTTGCAAGTCTGAGftTAA VA . GCTCAGCACCAAC . CTTGAC TTMCTAAHAGGACTCCTCAGGTCACCTTCTCACA 


TGATTTGCAT ATTgT( 


113 

rr^mG(^TiAnGTCCCCTAGG5TGGACCCnCCCCnOTGAGTCTC|^ 
rGATTTGCAT ATTGTCCCCTACCGTGGACCCTTCCCTTTGTGAGTCTG VG VTAA W VGCTCAGCTCTATC . CnGCCTTGACTC^TCAC^CTCCTCAGTTCACCTTCTCACC 
rGATTTGCAT ATTATCCCTTAGTGAAGAC . . . TTTCCTTGTGAGTCTO \G VTAA VA . GCTCAGCTCTACC . CTTGCCTTGACTGATCAGGACTCCTCACnTCACCTTCTCACA 
rGATTTGCAT ATTATCCCTTGGTGAAGAC . . . TTTCCTTG/GAGTCTG VG VTAA VA . GCTCAGCTCTAAC . CnGCCTTGACTGATCAGGACTCCTCAGTTCACCTTCTCACA 
TGATTTGCAT ATTGTCCCCTAGGGAGGACC. . TTCCCTTGTGAGCCTO \G VTAA VA , GCTCAGCTCTAAC . CTTGCCTTGACTGATCAGGACTTCTCAGTTCATCTTCTCACC 


AP2 
I 


GATA1 
TATA 


Tgr pcco 

TcTCCC-Ti 


TGATTTGCAT ATTGTCCCCTAGGGAGGACC . . TTCCCTTGTGAGCCTC AGATAA 
GGATTTGCAT ATTGTCCTCTAGGGAGGACC. .nCJ^CTTCTGCOTCTC AGATAA 


E2A 
.1 


aGgGagGACc . . TTcCcTTGTGAGTCTqAC^TAAj\Aj.GCTCAGC^rcTAAC . CTTGcCrraACTaATcAGGACTcCTCAGTTCAcCTTCTCACA 


VA .GCT IAGCTCtJaaC . CnaCTTOCTGATCAC^naCAGnCATCTTCTCACC 
VA . GCT PAGCTgI aACCC HACCTTGAC TGATCAg^TCCTCAGTTCACCTTCTCACT 


GGATTTGCAT ATTGT^CC^^GGACC . . TTCCCJTGCAAGTCTC AG ATAA VA . GCTCAGCACCAAC . CTTGACTTGAC TAAHAGGACTCCTCAGCfrCACCTTCTCACA 
GGATTTGCAT ACTGTCCCCTAGGGAGGACC , , TTCCATTGTGAGTCTC *G *TAA VA , GCTCAGCTGTAAC . CTTGCCnGACTGATCAGAACTCCTCAGCTCACCTTCTCACA 
GGATTTGCAT ACTGTCCCCTAGGGAGGACC . . TTCCATTGTGAGTCTC *C ATAA VA , GCTCAGCTGTAAC . CHGCCTTGACTGATCIAGAACTCCTCAGCTCACCTTCTCACA 
GGATTTGCAT 3TTGT CCCCTAGGGAAGACC . . TTCCCTTGTGAGTCTC Vc UAA VA , GCTCAGCTCTAAC . CTTGCCTTGACTGATCAGGACTCCTCAGTTCACCTTCTCACA 
sGATTTGCAT ^tTgTCCcctaGgGagGACc . . TtcCcTTGtoaGtCTC AC WVA VA , GCTCAGCtctAaC . ctTgcCTTgACTgATcAGgACTcCTCAGtTCAcCTTCTCACa 


L 

I . 19 

Oil ATGAfJGCTCCCTC<TCAOTTCCTGGGGCTGaAATGCTCTGGGTCCCTG 
01 ATGA(^TCCCTaT(^TCCTGGGGCTGCTAATGCTCTGGGTCCCTG 
A17 ATGAGGXTCCCTGCTCAOTCCTGGXjGCTGCTMTGCTCTGGGTCCCAG 
Al ATGAGGXTCCCTGCTCAGCTCCTGGGGCTGCTMTGCTCTGGGTCCCAG 
A2 ATGAGGCTCCCTGCTOVGCTCCTC3GGGCTGaMTG£TCTGC»TACCT6 
• A19 ATGAGGXITCCCTKTCAGXITCCTGGGGCTGCTAATGCTCTGGGTCTCTG 
A3 ATGAGGCTCCCTGCTCAGX;TCCTGI3GGXTGCTMTGX1TCTGGGTCTCT6 
A23 ATGAG^TCCTTGCTCArxnCTGTJGKTGCTMTG£TCTGGGTCCCTG 
pf-cons . ATGAGGCTCCcTGCTCAGCTcCTGGGGCTGXijT^TGCTCTG 


A18 ATGAGGCTCCnGCTWGCTCCTGGGGXTGCTAATGCTCTGGATCCCTG 

A2i atga(3Gx:tccctgctc^tcttggggctgx:tmt«:tctgggtccctg 

A7* atgaggctccttgctovgtjttctggggctgctaatgctctgggtccctg 

A29 ATAAGGGTCCCTCiCTCAGCTCCTGGTiG^TGCTMTG^TTTGGGTTCCTG 

A13 ATAAGGTTCCCTGCTWCKITCCTGGC^TGCTMTGCTTTGTJGTTCCTG 

L13 GTGAGGCTTCCCTGCTCAGCTCCTGGGGCTGCTAATGCTCTGGGTCCCTG 

cons . aTgAGGc TCCc TGC TCAGCTccTGGGGCTGCTAATGCTcTGGgTccCtG 


Fig. 3c 
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GTAAGGACAGAAA . GA3ATGAGGGAGGACAACTGGGTGGGA , 6 . . , 
GTAAGGACAGAAA . GA3A TGAGGGAGGACAAC TGGGTGGGA . G . . . 
GTAAGGGTA6AAGGGA3ATGAGGGAGGAGAATGGCATGGAACG . . . 
GTAAGGGTAGAAGG&GATGAGGGAGGAG4ATGGCATGGAACG . . . 
GTAAGGATGGAAG . GA3ATGAGGGAGGAGGAGGGGGTGGGAAG . . . 
GTM6AAAAGAAG , GA2ATGAGGAAGGAGAATAGGGTGGGAGG . . . 
GTAAGAAAAGAAG . GA3A TGAGGAAGGAGAATAGGGTGGGAGG . . . 
GTGAGGACA6AAGAG1GATGAGGGAGGAGAATGGGGTGGGAGG . . . 
GTaAGga-aGAAg . GASATGAGGgAGGAgaAt-GggTGggA-G . . . 


GTAAGGATGGAAG . &GATGAGGGAGGAGGAGGGGGTGGGAGG . . . 
GTAAGGACAGAAG.GWTGAGGGAGGAGAATGGGGTGGGAAG. . . 
GTGK3GACAGAAGAGA3ATGAGGGAGGAGWGGGGTGGGAGG. . . 
GTA^GGACAGAAGGGAEATAAGGGAGGAGAATGGAGTGTGAGG . . . 
GTAAGGACAGAAGGGAzATAAGGGAGGAGAATGGAGTGTGAGG . . . 
GTa/CgacaGAAg.GA5ATgAGGgAGGAgaAtgGgqTGggAgG. . . 


. . .TATCTATATAAACTTGCACTTCTCTGTTATTATTTCAG 
. . . TATCTATATAAACTTGCACTTCTCTGTTATTATTTCAG 
. . .TTTATACATAACCTTGCTCTTCTTTCTATTTATTTCAG- 
. . .TTTATACATAACCTTGCTCTTCTTTCTATTTATTTCAG 
. . . CTTTCACATAACCTTGCACTTCTCTCTCATAATTTCAG 

. . . TTTGTACATAACTTTGCAATT CATTATTTCAG 

. . . TTTGTACATAACTTTGCAATT CATTATTTCAG 

. . . TTTCTACATAACCTTGCACTTCTCTCTCATTATTTCAG 
. . ,tt T-tAcATAAccTTGCacTT^ct -tct"aTtATTTCAG 

. . .CmCAWTMCCTTGCACnCTCTCTCATAATTTCAG 
. . ,ATTGTACATATCCTTCCACTTCTTTCTATTTATTTTAG 
. , . TTTCTACATAACCTTGCACTTCTCTCTCATTATTTCAG 
. . . TTTGTACATAACTTTGCACTT . . CTCTCACTATTTCAG 
. .J.TTTGTACATAACCTTGCACTT. .CTCTCACTACTTCAG 
. . .ttT-tAcATAaccTTgCacTTctctctcattAtTTcAG 


Fig. 3d 


L' FR1 CDR1 

I I I 

f .1 . . . . . I ,105 


Oil GATCCATTGAGGATATTGTGATGACCWGACTCCACTCTCCCTGCCCGTCACCCCTGGAGAGCCC^CTCCATCTCC^ 


01 GATCCA37GAGGATATTGTGATGACCCAGACTCCACTCTCCCTGCCCGTCACCCCTGGA^^ 
A17 GATCCA7TGGGGA TGT TGTGATGAC TCAGTCT C CACTCTCCCTGCCCGTCACCC TTGGACAGC CGGCCTCCATCTCC" ' " 


Al GATC CA7TGGGGATG T TGTGATGAC TCAGTCTCCAC TCTCC CTGCCC6TCACCCTTGGACAGCCGGCCTCCATCTCC 

A2 GATCCAJTGCAGATATTGTGATGACCCAGACTCCACTCTCTCTGTCCGTCACCCCT^ . . . 

A19 GATCCAJ7GGGGATA TTGTGA TGACT WGTCTCCACTCTCCCTGC CCG TCACCCCTGGAGAGCCGGCCTCCATC TCCTGC^ 

A3 GATCCA~GGGGATATTGTGUGACTCAGTCTCCACTCTCCCTGCCCGTC 

A23 GATCCAJTCXjGGATATTGTGATGACCCAGACTCCACTCTCCTCACCTGTCACCCTTGGACAGCCGGCCTC 

pf-zjcns. GATCCATGggGATaTTGTWTGACM^-CTCWCTCTCc^^ 


GCAGGTC T AGTCAGAGCCTC TTGGA TA 


gcaggtctagtcaaagcctcgtataca 
gcaggtctagtcaaagcctcgtataca 

GCAAGTCTAGTCAGAGCCTCCTGCATA 


011a« GCCTCCATCTCCTGCAGGTCTAGTCAGAGCCTCTTGGATA 
A18 GATCCATGCGGATATTGTGATtaCCCAGACTCCACTCTCTCTC^^ 

A21 GATCCAifTGCAGAGA T TGTGATGACCCAGACTC CACTC TCCTTGTC TATCACCCCTGGAGAGCAGGCC TCCAT6TCC TGCAGGTCTAGTCAGAGC CTCCTGCAT A 
A7* GA TCCAJ7GGGGAT A TTGTGA TGACCCAGACTCCAC TCTCCTCGCC TGTCACCCTTGGACAGCCGGCCTCCATC TCCTTCAGGTC TAGTCAAAGCC TCGT ACACA 
A29 GATCCAF7GGAGATATTGTGATGACCCAGACTCCACCCTCCCTGCCCGTCMCCCTGGAGAGCCGGCCTCCATCTCTTGCAGGTCTAGT 
A13 GATCCAiiTGGAGATATTGTGATGACCCAGACTCCACCCTCCCTK 
UL3 GATCC^CEGGATATTGTGATGACCCAGCATCTGaCTCCCTra 
cons. GATCCA^GggGAtaTTGTGATGACcC^acTCo^ 

FR2 CDR2 FR3 

I II 

Oil GTGATGiTGGAAACACCTATTTGGACTCEtAOT 

01 GTGAT&iTGGAAACACCTATnGGACTGGTACCTGCAGA.AGCCAGGGCAGTCTCCACAGCTCCT^ 

A17 GTGAT . . GGAMCACCTACTTGAATTGGTTTO^GCAGA. GGCCAGGCCAATCTCC 

Al GTGAT. ..GGAMCACCTACTTGAATTGGTTTCAGOVGA.GGCCAGGCCMTCTCCAAGGCGCCT 

A2 GTGAT . - . GGAAAGACCTATTTGTATTGGTACCTGCAGA . AGCCAGGCCAGCCTCCACAGCTCCTGATCTATGAAGTTTCCAACCGGTTCTCTGGAGTGCCAGAT 

A19 GTAAT . - . GGATACMCTATnGGATTGGTACCTGCAGA , AGCCAGGGCAGTCTCW(^GCTCC^ 

A3 GTAAT . . . GGATACMCTAmGGATTGGTACaGCAC^. AGCOVGGGCAGTCTCW 

A23 GTGAT . „ . GGAAACACCTACTTGAGTTGGCTTCAGCAGA . GGCCAGGCCAGCCTCCAAGACTCrTMTTTATAAGATTTCTAACCGGTTCTCTGGGGTCCCAGAC 

pf-ccns . GTcAT . - . GGAAAcAcCTATTTG-ATTGGTAcCTtX^ . AGCCAGfr 

011a» GTGAT&iiO^CACaATTTGGATTGTTACCTGCAGA^ 
A18 GTGAT . , . GGAAAGACCTATTTGTATTGGTACCTGCAGA . AGCCAGGCCAGTCTCCACAGCTCCTGATCTATGAAGT^CCAGCCGGTTCTCT 
A21- GTGAT . . . GGATACACCTArrTGTATTGGTTTCTGCAGAAAGCCAGGCCAGTCTCCAOUjCTCCTGATCTATGAA^ 

A7« GTGAT . . . GGAAACACCTACTTGAGTTGGCTTCAGCAGA . GGCCAGGCCAGCCTCCAAGACTCCTMTTTATAAGGTnCTMCCGGTTCTCTGGGGTCCCAGAC 
A29 GTAAT . . GGATATACCTATTTGCATTGGTACCTGCAGA. AGCCAGGGCA^ 

A13 GTAAT . . . GGAT A T ACC TATT TGCATTGGTAC CCGCAGA . AGCCAGGGCMTCTCCACAGCTCCTGATTTATAGGGTT^ 
L13 GTGAT . . . GGAAACACCTATTTGGATTGGTACCTGCAGA . AGCCAGGCCAGTCTCCAWGCTTCTTATCTACACMT^ 
cons . GTgAT , . . GGAaAcAcCTAtTTG-atTGgtacCtGCAGA . aGCCAGGcCAqtCTCMcagCtcCT^TcTAta^gtTTCc^^ 

CDR3 
I 

. I . 315 

Oil AGGTTCiGTGGCAGTGGGTCAGGCACTGAmCACACTGAAMTCA^ 

01 AGGnC^GGCAGTGGGTCAGG(^CTGATTTCACACTGAAMTCAGCAG(^ 
A17 AGATTt^GGCAGTGGGTCAGGCACTGATTTCACACTGAAMTOC 

Al AGATTCAXGGCAGTGGGTC^GGCACTGATTTCACACTGAAMTCAGOVGGGTG^ 

A2 AGGTTCAJTGGWGCGGGTCAGraCAGATnCACACTGAAAATCAGCCGGGTG^ 
A19 AGGnCC-TGGCAGTGGATC/GGCACAGATnTACACTGAAMTCAGCAGAGTGGAGGCTGA^ 

A3 AGGnCATTGGCAGTGGATCXGOUIAGAT^ 
A23 ^^nC^TGGCAGTGGGGC^GGGACAGATnCACACTGAAAATCAGCAGGGTGGAAGC 
pf-:ons . AGgTTCAjtGGCAGtGGgtWGGcAC-GATTTcACACTGAAMTCAGCaGgGTGGAgGCT 

ZllA* AGGTTCCTWCAGTGGGTO^GGCACTGAtTTCACACTGAAMTCAGCAGGGTGGAGK 

A18 AGGTTCATTGGCAGCGGGTCAGGGACAGATTTCACACTGAAMTCAGCCGGGTGGAGGCTGAGGATGn 

A21 .AGGTTC^jTGGCAGCGGGTC^GGGAOUSATTTCACACTGAAAATCAGCCGGGTGGAGrc 

A7» AGATTC^TTGGCAGTGGGGC^GGGACAGATTTCACACTGAAAATCAGCAGGGTGGAAGCTGAGGATGTCGGC^ 

A29 AGG T TT iGTGGCAG TGGGT GGTAGTGATTTCAC ACTGAAAATCAGCTGGGTGGAGGCTGAGGA TGTTGGGGTT TA TTAC TGCATGCAAGCTACACAGTTTC C T 

413 AGGTTT^TGGCAGTGGGTar^(^TGATTTCACACTGAAW\TCAGCTGGGTGGAGGCTGAGGATGTT 

U3 aSSttc^tggcagcaggtc^gggacaggtttcacacttaaattcagcaaagtggaggctgaggatgt 

:cns, AGGTTci:-rGG(^TGGGTUGGcACAGATTTcACACTGAAMTCAGCAGGGTGGA(^Cra 
Fig.?* 
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Oil TCCACAGTGGTACAGCCCTGAACAGAAACCTCCCTGCTGT . GGTGCCCCAGC TGCTCACATGCACTGCTTGTCTGGGGftGCAGGTCAGCAGCGTCTCTGA 

A17 CCW^TOTACAGCCCTGMCA^ 

Al CCCACAGTGGTACAGCCCTGAACAAAMCCTCCCTG.TGG.GGTGGCCCAGCTGCC^ 

A2 CCttt^TGGTACAGACCAATACAGAMCm^ 

A19 CCCACAGTOSTACMCCCCTMCAGAMCCTCCJCCTGG.GGTTGCCTAGnGCTCA^ 

A3 CCCACAGTGGTACAACCCCTAACAGAAACCTCC . TCCTGG . GGTTGCCCAGTTGCTCACATGTGCTGCTTGTCTGGAGAGCAGCTCAGCAGGGTCTCTGA 

A23 CACACAGTGGTACAGCTCTGAACAAAAACCTCC « CGCTGG . AGTGGCCCAGGGCG ^ _ r ^ m „^ Mf . „ ^ 

r-coNS. ^ccCACAGTGGTACAgccCtgM(^gAMCCTCC ctgcTGq.gGTggCCcAGctqctCACatGtgcTGC 

011a* TCCACAGTGGTAWGCCCTGAACAGAMCCTCCCTGCTGt.ffiTGCCCCAGCTGC^ 
A18 CCCACAGTGGTACAGACCAATACAGAAACCTCCCTGCTGG.GGTGTC 
A21 CCWCAGTKTACAGCCCTGAGCACAMCCTCTC^ 

A7» CACACAGTGGTACAGCCCTGAACAAAAACCTCC . CGCTGG . AGTGGCCCAGCTGCTCAAGTGTGTTGTTTCTC 
A29 MCACAGTGGTAWCCCTGAACAGAMCCTCCCTTCn^ 

A13 AACACAGTGGTACAACCCTGAACAGAAACCTCCCTTCTTGCTGTGGTTCAGCTGCCCAAATG 

L13 CACACAGTGGTAAMCCCTGMCACAMCCTCCCTACTTGGGATGGCCCAGCCATCCAC^GTGn „ _ _ _ 

cons* ccCACAG1^TAcAgccCt(^aCAgAA^TCcctgcTgg.ggTggccc^ 


100 


Fig.3f 


84 

A27A GATCACCTGGGCA. . . TGGGC TGCTGAGAGCAGAAAGG . GGAGGCAGATTGTCTCTGCAGC TGCAAGCCCAGCACCCGCC . . .CCAGC 

AIIa GATCACCTGGGCA. . . TGGGC TGCTGAGTGCAGAGAGG. GGAGGCAGATGGTCTC 

L2 GACMCOIGGGCGGCGCAGGATGCT^TGO^^ 

U6a GACMCCAGGGCGGCGCAGGATGCTCAGTGC^ ., .CCAGC 

L6 CTGCAGCTGGAAGCTCAGCTCCCA.ee . . . .CAGC 

L25 GACAACCAAGGTGGCGCAGGATGCTCAGTGCAGAGAGGAA . . . GCAGGTGGTATCTGCAGCTGGAAGCCCAGCTCCCACp ♦ . . CCAGC 

pf-cons . GAcaACCagGGcggcgcaGGaTGCTcAGtGCAGAaAGGaaga . G(^3gTgGTcTcTGcAgCTGgAAGC-CAGCtCCCaCC . . . cCAGC 

L10 ' CTO^TGQW^TCAGCtcCCACC . . v jCAGC 

cons. GAcaACCagCScggcgcaGGaTGCTcAGtG^ .cCAGC 

Fig. 4a 

Fig . 4 . Alignment of the sequences of V x Ul gene regions . 
The presentation is analogous to Fig. 2. 

a) Sequences upstream of the dc box. For a possible primer see Fig. 3a. 

b) Sequences upstream of the leader region. One possible primer is shown. 

c) The exon 1 sequences are shown together with 8bp of upstream sequence in order to include the alternative start codon of 
L25^L For a possible primer see Fig. 2c. 

d) Intron. For the possible primer see Fig. 2d. 

e) Exon 2. For possible primers see Fig. 2e. 

f) 100 bp downstream of exon 2. For primers see Fig. 2f. 
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dc 


TGCTTTGCAiljTCCcfeCAGCCdCCCTGCAGTCCAGAGCCckTATCAAte 

TGCTTTGCAT ST CCCjECCAGCCC CCCTCCAGTCCAGAGCCC UATCAA7 SCCTGGGTDVGaIsCTCTGGGGAG^ 


TGCTTTGCAT 3TCCCJCC IAGCTC CCCTACCTTCCAGAGCCC VTATCAAT 3CCTGGGT* 


TGCTTTGCAT 3TCCCTCC IAGCTC XCTACCTTCCAGGGCCC UATCAAT jCCTGGGTCAGAGCTCTGGGGAGGAACTGCT 


TGCTTTGCAT 3TCCCTCC 


TGCTTTGCAT 3TCCCTCC 


AP2 
j E2A 


TATA 


AP2 


MYB 


TCAGaIjCT 


.1 


TCTGGGGAGGAACTGCT 3AGTTA 3GACCCAGACGGAACC 


t^VGCT(£CCTACCTTCCAGA£CCChTATCMTfa iLAGTTA jGACCCAGAGGGAACC 

CTG CCCTACCTTCCAGAGCCC \TATCAAT jCCTGGGTCAGA GCTCTGGGG AGGAACTGCT IAGTTA 3GACCCAGACGGAACC 
TGCTTTGCATpTCCCTCCpAGCTG XCTACCnCCAGGGCCC WATCAA7 XCTGGGTCAGA DCCCTGGGb AGGAACTGCT ZAGTTA 3GACCCAGATGGAACC 
:AGCt4^aCctTCCAGaGCCC^ 


TGCTTTGCAT 3TCCCTCC IAGCTC XCTACCHCCAGGGCCC AJATCAAT 3CCTGGGTCAG* GCTCTGGGGAGGAACTGCT CAGTTA 3GACCCAGACGGAACC 
TGCTTTGCATfeTCCaCC IAGCTG XCTACCTTCCAGAGCCC A.TATCAAT 3CCTGGGTCAG ^GCCCTGG(j AAGGAACTGCT CAGTT^ 3GACCCAGACGGAACC 


:agtta 3gacccagacggaacc 


TGCTTTGCAT 3TCCCTCC :AGCtC XCTaCctTCCAGaGCCC IATATCMTI gCCT^ 


Fig. 4b 


L 



A27 AGGGAACCATGGAAACCCCAGCGCAGCTTCT . CTTCCTCCTGCTACTCTGGCTCCCAG 

All aGGGAACCATGGAAACCCCAGCGCAGC TTCT . CTTCCTCCTGCTACTCTGGCTCCCAG 

L2 ACGGAACCATGGAAGCCCCAGCGCAGCTTCT . CTTCCTCCTGCTACTCTGGCTCCCAG 

116 ' AO^CWTCZiAAGCCCCAGCGCAGCTTCT. CTTCCTCCTGCTACTCTGGCTCCCAG 

L16c ACGGAACCATfeGAAGCCCCAGCGCAGCTTCT . CTTCCTCCTGCTACTCTGGCTCCCAG 
UO ACGGAACCATGGAAGCCCCAGCTCAGCTTCT , CTTCCTCCTGCTACTCTGGCTCCCAG 
cons. A-GGAACCATGGAAgCCCCAGCgCAGCTTCT . c nCCTCCTGCTACTCTGGCTC^C AG 

Fig. 4c 


100 

A27 GTGAGGGGAACATGG3ATGGTTTTGCATGTCAGTGAAAACCCTC TCAAGTCCTGTTACCTGGCAACTCTGCTCAGTCAATACAA . TAATTAAAGCTCAATAT 

All GTGAGGGGMCATGGGATGGTTTTGCATGTCAGTGAAAACCCTCTCAAGTCCTGTTAC^ 

L2 GTGftGGGGAATATGAGGTGGTTTTGCACATCAGTGAAAAC TCCTGCCACCT CTGCTCAG . CAAGAAATATAATTAAAAT TCAA TG T 

Q6 GTGAGGGGAA TA TGAGGTGGT TTTGCACATCAGTGAAAAC TCCTGCCACCT CTGCTCAG . CAAGAAATATAATTAAAATTCAATGT 

L6 GTGAGGGGAACATGAGGTGGTTTTGCACATTAGTGAAAAC TCTTGCCACCT CTGCTCAG . CAAGAMTATMTTAAAAnCAAAGT 

L20 GTGAGGGGAA TATGAGGTGTCTTTGCACAT CAGTGAAAAC TCCTGCCACa CTGCTCAG. CAAGAAATATAATTAAAATTCAAAAT 

L25 GTGAGGGGAA TATGAGGTGGTTTTGCACATCAGTGAAAAC TCCATCAGGAGTTTTCT . CTGCTCAG . CAAGAAATATAATTAAAATTCAAAGT 

pf-cons . GTGAGGGGAAtATGaGqTGgtTTTGCAc^Tc>\GTGAAAAC TCctgccacct CTGCTcAg , CAAqAAAtaTAATTAAAatTCAAtcT 

L16C GTGAGGGGaATATGW^GGTTTTGCACATCAGTGAAAAC TCCTGCCACCT CTGCTCAG . CAAGAAATATAATTAAMTTWTGT 

LLO GTGAGGGGAACATG^GGTGGTTTTGCACATCAGTGAAAAC TCCTGCCACCT CTGCT(^.CAAGAMTATAAnA^TT^TGT 

cons . GTGAGGGGAAtA TGaGg TGgtTTTGCAcaTcAGTGAAAAC TCctgccacct CTGCTcAg . CAAgAAAtaTAATTAAAatTCAAtgT 

196 

A27 AAAGC AA T AAT TCTGGCTCTT CTGGGAAGACAATGGGTTTGATTT A6AT TACATGGGTGACTTnC TGTTTTAnTCf^TCTCAG 

All AAAGCAATMTTCT{^TCTTCTGGGAAGACAGTGGG 

L2 AGATCAACAATT T TG3C T CTACTT . AAAGACAGTGGGTTTGATTTTGATTACATGAGTG . CATnCTGTT^ATTTCCAATTTCAG 

L16 AGATCAACAATT TTGGCTCTACTT . AAAGACAGTGGGnTGATTTTGAnACATGAGTG. CAnTCTCTTnAmc WTTTCAG 

L6 ATA TC AAC AATTTTGGCTCTACTC . AAAGACAGTTGGTTTGATCTTGATTACATGAGTG , MTTTCTGTTTTATTTCCAATTTCAG 

L20 AGATCAACAATT TT&3CTCTACTC . AAAGACAGTGGGTTTGATTTTGATTACATGAGTG . CATTTCTGTTTTATTTCCAATTTCAG 

L25 AGATGAACAATT TTG3CTCTACTC , AAAGACAGCTGGTTTGATCTAGATTACATGAGTG . CATTTCTOTTTTATTTCCAATCTCAG 

pf-cons. AgAtcaAcAATT tTGGCTCTaCT- . aAAGACAgTgGGTTTGATtTtGATTACATGaGTG . CaTTTCTgTTTTATTTCCAATtTCAG 

.♦»•••• ISo 

L16c agatcaacamttt&:<tctactt>aaagacagtgggttt^^ 

lio agatc aacaattttgjCTCtactc . aaagacagctggtttgatcta^ttacatgagtg . catttctgttttatttccaatctt . G 

cons. AoAtcMc*ATTtTG5CTCTaCT-.aAAGACAgt^ 


Fig.4d 
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L' FR1 CDR1 

II 1 

. . . . . . .1 • ioo 

A27 ATACCACCGGAGAAATTGTGnGACGCACT 

All ATACCACCGGAGAAATTGTGTT6ACGCAGTCTCCAGCCACCCTGTCTTTGTCTCCAGGGGAAA6AGCCACCCTCTCC 

L2 ATACWCTGGAGAAATAGTGATGACGCAGTCTCCAGCWCCCTGTCTGTGTCTCCAGGGGAAAGAGCCACCCTC^ 

U6- ATACCACT^GAMTAGTGATGACGCAGTCTCCAGCCACCCTGTCTGT 

L6 ATACCACCGGAGAAATTGTGTTWCACAGTCTCCAGCCACCCTGTCTTTGTCTCCAGGGGAAAGAG^ 

L20 ATACCACCGGAGAAAnGTGnGACACAGTCTCCAGCCACCCTGTCTTTGTCTCCAGGGGAA 

L25 ATACWCCGGAGAMTTGTAATGACACAGTCTCCAGCCACCCTGTCTT^ 

cons? ATAC&CcGGA^TtGTgtTGACgC^ 

L16c ATACCACTGGAGAAATAGTGATGATGCAGtcrCCAGCCACCCTGTCTGTOTCTCCAGGGG^ 

FR2 CDR2 FR3 

! . . . .1 I 200 

A27 CAGCAGCTACTTAGCCTC&TACCAGCAGAMCCTGGCCAG 

All CAGCAKTACTTAGCCTGGTACCAGCAGAAACCTGGCCTGGCGCCCAGGCTCCTW 

L2 CYTT^CAACTTAGCCTGGTACCAGCAGAAACCTGK 

U6 C . . . AGCAACTTAGCC TGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATffi • 

16 C . . .AGCTACTTAGCCTGGTACCMCAGAMCCTGGCCAGGCTCCCAGGCTC^ 

120 C . . . AGCTACTTAGCCTGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCT 

L25 CAGCAGCTACTTATCCTGGTACCAGCAGAAACCTGGGCAGGCTCCCAG^ 

cons . C . . . AGCtACTTAgCCTGGTACCAgCAGAAACCTGGc(>GGCtCCCAGK 

L16c C . . . AGCMCTTAGCCTGGtACCAGCAGA^ 

L10 CAGCAGCTACTTAACCTGGTATCAGOU3AAACCTGGCCAGGCGCCCAGGCTCCTCATCTATGOT 

cons . C . . . AGCtACTTAgCCTGGTAcCAgCAGAAACCTGGcCaGGCtCCCAGGCTCCTCATCTATG^ 

CDR3 

. . . i; 299 

A27 AGTGGCAGTGGGTCTGGGACAGACTTCACTCT^ 

Ail AGTGGCAGTGCGTCTGGGAWGACT™ 

L2 AGTGGCAGTGGGTCTGGGACAGAGTTCACTCTCACCATWGCAGCCTGCAGTCT^ 

U6 AGTGGCAGTGGGTCTGGGACAGAGrTCACTCTCA^ 

L6 AGTGGCWTGGGTCTGGGACAGACTTCACTCTCACCATCAGCAGCCTAGAGC^ 

L20 AGTGGCAGTGGGCCTGGGAWGACTTCACTCTC^ 

L2S AGTGGCAGTGGGTCTGGGACAGACTTCACTCTCACWTCAGCAGCCTGCAGCCTG^ 

cons. ACTGGCAGTGGGtCTGGGACAGAc^ 

L16C AGTGGCAGTOTTCTGGGACAGAGTTCACfCTWCCATCAGCAGCaGCAGTCT 

L10 AGTGGCAGTGGGTCTGGGACAGACTTCACTCTCACCATC^ 

cons * AGTGGCAGTGGGtCTGG£^(^cTTCACTCTCAC(^TCAGCAGcCTgcA^ 

Fig.4e 
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A27a cccacagtgattcagcttgaaacaaaaacctctgcaagaccttcattgtttactagatta. , . TACCAGCTGGTTCCTTTACAGATAGCTGCTGCAATGA 

AllA CCCACAGTGATTO^nGAAACAAAAACCTCTGCAAGACCTTCATTGTTTACTAGATTA. . . TACCAGCTGCTTCCTTTACAGATAGCTGCTGCAATGA 

L2 CCCACAGTGATTCAACATGAAACAAAAACCTCAAGAAGACCATCAGTGTTTACTAGATTA. . . TACCAGCTGCTTCCTTTACAGACAGCTAGTGTGGTGG 

U6 CCCACAGTGATTWCATGAAACAAAAACCTCAAGAAGACCATCAGTG^ 

L20 CCCACAGTGAnCCACATGAAACAAAAACCCCMCAAGACCATCAGTGTnACTAGATTATTATACCAG 

L25 CCCACAGTGATTCAACATGAAACAAAAACCTCAACAAGACCATCAGTGTTTACTAGATTT . . . TACCAGCTGCTTCCTTTACAGACAGCTAATGTGG rGG 

pf-cons . CC CACAGTGA TTCaaCaTGAA ACAAAAACCt CaacAAGACCaTCAgTGTTTACTAGATTa . . . TACCAGCTGCTTCCTTTACAGAcAGCTa-TG-ggTGg 

UO CCCA(^TWTTCAAaTGAMCAAAAACCTCMCAAGACW 

cons. ^CCCACAGTGATTCaaCaTGAAA(>AAAACCt (>acAAGA . .TACCAGC7GCTTCCTTTACAGAcAGCTa-TG-ggTGg 

Fig.4f 
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Fig. 5. Alignment of the formal translation products of the potentially functional and slightly defective V x genes of subgroups I -III 
(a-c). 

The numbering of the residues and the definition of invariant residues (underlined) are as in ref . [28} . The four highly conserved re- 
sidues (see text) are underlined twice. In panel (d) the residues conserved in the sequences of the formal translation products of 
subgroup I-IVgenes are shown. 
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Addendum to Table 1 


a) Cox, Tomlinson and Winter amplified from genomic DNA by PCR 
FR1-CDR3 sequences and called four of them "new gene segments" 
(Eur. J. Immunol. 24, 827-836, 1994). However, the sequences do 
not correspond to new gene loci but have to be considered as 
alleles of published V K genes and orphons, as far as this can be 
concluded in the absence of intron sequences and data on the 
genomic context. 

DPK14, A21/A5; DPK37, Z3; LFVK5, V118; LFVK431, LI. 

The comparisons are described in Klein and Zachau, Ann. NY Acad. 
Sci. , in press. 

b) The reference for the gene Alia is [54]. 

c) Alternative accession numbers 

for Z2, Z3, and Z4 are S37418/19/21. 

Corrigendum of Table 3 

Subgroup III: intron length [bp] 168-1446; 

similarity [%] 64-100. 


Update of the list of references 

4 (1994) 2. Edition, in press. 

6 (1993) Gene 135, 167-173. 

11 (1993) Eur. J. Immunol. 23, 2860-2867. 

12 (1993) Eur. J. Immunol. 23, 2868-2875. 

16 (1993) Eur. J. Immunol. 23, 3248-3271. 

17 (1994) Biol. Chem. Hoppe-Seyler 375, 189-199. 
19 (1994) Genomics 22, 213-215. 
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Appendix E 


Genetic and Immunological Properties of Phage-Displayed Human Anti-Rh(D) 
Antibodies: Implications for Rh(D) Epitope Topology 

By Tylis Y. Chang and Don L Siegel 


Understanding anti-Rh(D) antibodies on a molecular level 
would facilitate the genetic analysis of the human immune 
response to Rh(D) r lead to the design of therapeutically 
useful reagents that modulate antibody binding, and provide 
relevant information regarding the structural organization of 
Rh(D) epitopes. Previously, we described a Fab/phage display- 
based method for producing a large array of anti-Rh(D) 
antibodies from the peripheral blood lymphocytes of a single 
alloimmunized donor. In the current study, we present a 
detailed analysis of 83 randomly selected clones. Sequence 
analysis showed the presence of 28 unique 71 heavy chain 
and 41 unique light chain gene segments. These paired to 
produce 53 unique Fabs that had specificity for at least half of 
the major Rh(D) epitopes. Surprisingly, despite this diversity, 
only 4 closely related heavy chain germline genes were used 
(VH3-30, VH3-30.3, VH3-33, and VH3-21). Similarly, nearly all 
V K light chains (15/18) were derived from one germline gene 
(DPK9). X light chains showed a more diverse V L gene usage, 

CLINICALLY, THE HUMAN Rh(D) antigen is the most 
important red blood cell (RBC) membrane protein in 
transfusion medicine. The alloimmune response against Rh(D) 
produces high-affinity IgG antibodies that cause hemolytic 
transfusion reactions and hemolytic disease of the newborn 
(HDN). The prophylactic use of Rh(D)-immune globulin in 
pregnant Rh(D)-negative women has been a major advance in 
the prevention of HDN, 1 yet the mechanism by which the drug 
exerts its immune modulatory effect is not well understood. 2 
Monoclonal antibodies (MoAbs) derived from the B cells of 
Rh(D)-immune globulin donors have defined several dozen 
Rh(D) epitopes 3 ; paradoxically, the Rh(D) antigen, an approxi- 
mately 30-kD transmembrane protein, has minimal extracellu- 
lar mass and presents a very limited surface area for epitope 
expression. 4 * 9 The molecular cloning of large repertoires of 
anti-Rh(D) antibodies would help reconcile these observations. 
In addition, it would facilitate the rational development of 
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but all (23/23) used the identical J x 2 gene. Several Fabs that 
differed in epitope specificity used identical heavy chains but 
different light chains. In particular, 2 such clones differed by 
only 3 residues, which resulted in a change from epD2 to 
epD3 specificity. These results suggest a model in which 
footprints of anti-Rh(D) antibodies are essentially identical 
to one another, and Rh(D) epitopes, as classically defined by 
panels of Rh(D) variant cells, are not discrete entities. Further- 
more, these data imply that the epitope specificity of an 
anti-Rh(D) antibody can change during the course of somatic 
mutation. From a clinical perspective, this process, which we 
term epitope migration, has significance for the design of 
agents that modulate antibody production and for the cre- 
ation of mimetics that block antibody binding in the settings 
of transfusion reactions and hemolytic disease of the new- 
bom. 

© 1998 by The American Society of Hematology. 

recombinant formulations of Rh(D)-immune globulin and aid in 
the design of therapeutic agents that block antibody binding. 
Furthermore, the comprehensive genetic analysis of anti-Rh(D) 
antibodies within a given alloimmunized individual would 
serve as a paradigm for human immune repertoire development, 
an area in which limited information is currently available. 

Previously, no more than 8 IgG anti-Rh(D) human MoAbs 
have been derived from a single individual. 10 The primary 
challenge in studying the Rh(D) immune response has been 
technical difficulties inherent in human B-cell immortalization. 
Epstein-Barr virus (EBV) transformation results in relatively 
low transformation efficiencies 11 that can undergo a decline in 
antibody production, 12 " 15 whereas cell fusion methods have 
been hampered by the lack of good fusion partners. 16,17 More 
recently, molecular approaches have been developed that by- 
pass the need for cell transformation. 18 " 20 Conceptually, these 
techniques, referred to as repertoire cloning or Fab/phage 
display, seek to immortalize Ig mRNA rather than the B cells 
from which they were derived. In an earlier report, our 
laboratory adapted these techniques for isolating Fab/phage 
antibodies directed against conformation-dependent antigens 
expressed on cell surfaces. 21 Using intact human RBCs, we 
isolated highly diverse y y K and 71X Fab/phage libraries against 
the Rh(D) antigen from the B cells of a single Rh(D)-immune 
globulin donor. 22 

In the following report, we present a detailed genetic and 
serological analysis of 53 unique anti-Rh(D) antibodies derived 
from 83 randomly chosen clones. The results complement 
previous reports on the genetic and biochemical makeup of 
monoclonal anti-Rh(D) antibodies derived from multiple do- 
nors. 10 * 23 " 25 Significantly, our data also demonstrate extensive 
genetic homology between antibodies directed against different 
Rh(D) epitopes. We provide evidence that antibodies directed 
against different epitopes can be clonal ly related. Finally, we 
suggest a model that reconciles the serological diversity of 
anti-Rh(D) antibodies with the topological constraints imposed 
by the Rh(D) antigen. 
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MATERIALS AND METHODS 

Production of Monoclonal Anti-Rh(D) Phage-Displayed 
and Soluble Fab Molecules 

Methods for the isolation of human anti-Rh(D)-specific antibodies 
from 71K and -yjX Fab/phage display libraries using the pComb3H 
phagemid vector and a cell-surface panning protocol have been 
previously published. 22 Soluble anti-Rh(D) Fab preparations for inhibi- 
tion studies were produced from bacterial cultures transfected with 
plasmid DNA from which the Ml 3 gene III coat protein sequence had 
been excised as described. 21 ^ 6 Cultures were grown by shaking at 300 
RPM at 37°C in superbroth (30 g/L tryptone, 20 g/L yeast, 10 g/L 
MOPS, pH 7.00) containing 20 mmol/L MgCl 2 and 50 ug/mL carbeni- 
cillin to an OD 600 of 0.5. Isopropyl-(3-D-thiogalactopyranoside (IPTG) 
was added to 1 mmol/L and cultures were shaken overnight at 30°C. 
Bacterial pellets were harvested and resuspended in l/50th of the initial 
culture volume with osmotic shock buffer (500 mmol/L sucrose, 1 
mmol/L EDTA, 100 mmol/L Tris, pH 8.00), incubated for 30 minutes at 
4°C, and centrifuged at 16,000g for 15 minutes at 4°C. Fab-containing 
supernatants were dialyzed against phosphate-buffered saline (PBS) and 
used in agglutination experiments without further purification. 

Anti-Rh(D) Antibody Binding Assays 

The binding of anti-Rh(D) Fab/phage or soluble Fab molecules to 
normal or partial Rh(D) antigens was assessed by indirect agglutination 
assays as described. 21 22 Briefly, 100 uL aliquots of phage-di splayed 
Fabs or soluble Fabs were incubated with 50 uL of a 3% suspension of 
RBCs. After 1 hour of incubation at 37°C, the RBCs were washed three 
times with 2 mL of cold PBS to remove unbound antibody. The 
resulting RBC pellets were resuspended in 100 uL of a 10 ug/mL 
solution of sheep anti-M!3 antibody (5 Prime — 3 Prime, Boulder, CO) 
for Fab/phage experiments or goat antihuman k or X light chain 
antibody (Tago, Burlingame, CA) for 71 k or 7,\ soluble Fab experi- 
ments, respectively. The RBC suspensions were transferred to the 
round-bottomed wells of a 96-well microplate and left undisturbed for 2 
hours. Negative reactions show sharp approximately 2-mm diameter 
RBC spots, whereas the RBCs in agglutinated wells form a thin carpet 
coating the entire floor of the well. 22 Agglutination titers for recombi- 
nant antibodies were determined by performing serial twofold dilutions 
in 1% bovine serum albumin (BSA)/PBS. Typically, Fab/phage had 
agglutination titers of 1/1,024 to 1/2,048 (where neat is defined as 5 X 
10 12 tfu/mL), 22 and soluble Fabs had agglutination titers of 1/64 to 
1/128 when prepared as described above. 

For determining Rh(D) epitope specificity for anti-Rh(D) Fab/phage 
antibodies, the following reference Rh(D) variant cells were obtained 
from the MRC Blood Group Unit (London, UK), The New York Blood 
Center (New York, NY), or Gamma Biologicals, Inc (Houston, TX): 
0/D IIIa Cce, G positive; B/D IIIc Cce; A/D^cc; A/D^ce; O/D^e; 
O/D^Cce; B/D^Cce, Go a negative, Rh32 negative; O/D^Cce; 0/ 
D Va cEe, D w positive; 0/D VI Cce; B/D VI Cce; AB/D^Cce; A/D vr cEe; 
O/D^Cce; and O/D^Cce. Each Fab/phage antibody was tested on at 
least three separate occasions against at least two different examples of 
each variant cell type, and identical epitope assignments were obtained 
each time. For antibodies that demonstrated previously undescribed 
patterns of reactivity or repeatedly weak reactivity against one type of 
cell (see the Results), monoclonal Fab/phage were prepared on a least 
four separate occasions to verify the patterns of reactivity. 

For inhibition studies, the ability of antibodies with different Rh(D) 
epitope specificities to compete with each other for binding was 
assessed by preparing stocks of each clone in both a soluble Fab form 
and a phage-displayed form. Pairwise combinations of soluble Fabs and 
Fab/phage were prepared and added to Rh(D)-positive RBCs. The 
resulting incubation mixes comprised 50 uL of a 3% suspension of 


RBCs, 100 uL of undiluted soluble Fab, and 100 uL of Fab/phage 
diluted to its highest agglutinating titer. After 1 hour of incubation at 
37°C, RBCs were washed, resuspended in anti-M13 antibody, and 
placed in microplate wells as described above. That the amount of 
soluble Fab present in an incubation mixture was sufficient to compete 
away a Fab/phage that shared the same binding site was determined by 
verifying that each soluble Fab preparation could block its own 
Fab/phage (see the Results). 

Inhibition experiments were also performed using pairwise combina- 
tions of soluble Fabs instead of soluble Fab and Fab/phage combina- 
tions. In this type of experiment, pairs of soluble Fabs specific for 
different epitopes were chosen such that one Fab contained a X light 
chain and the other a k light chain. Incubations with RBCs were 
performed with one Fab in excess and the other in limiting amounts. 
Blocking of the latter antibody was assessed using a secondary antibody 
(anti-X or anti-ic) specific for its light chain isotype. 

Nucleotide Sequencing and Analysis 

Plasmid DNA for sequencing was prepared using the Qiawell system 
(Qiagen, Chatsworth, CA). Double-stranded DNA was sequenced using 
light chain or heavy chain Ig constant region reverse primers or a set of 
unique pComb3H vector primers that anneal 5' to the respective Ig 
chain 26 ' 27 and automated fluorescence sequencing (Applied Biosystems, 
Foster City, CA; DNA Sequencing Facility, University of Pennsylvania 
Department of Genetics and Cancer Center, Philadelphia, PA). Se- 
quence analysis and variable region germlinc assignments were per- 
formed using DNAplot 28 and the V Base Directory of Human V Gene 
Sequences (March 1 997 update). 29 Germline assignments were corrobo- 
rated with the Mac Vector (v. 6.0) software package (Oxford Molecular 
Group, Oxford, UK) against the same database. Multiple sequence 
alignments and predictions of isoelectric point were calculated using the 
Pileup and Isoelectric programs of the GCG software package (v. 8.0. 1 ; 
GCG, Madison, WI). Statistical analysis was performed with Statview 
(Abacus Concepts, Berkeley, CA). 

Because of the large number of heavy and light chain sequences (N = 
69), only alignments of the predicted amino acid sequences are 
presented. Nucleotide sequences of all clones are available in Genbank. 

RESULTS 

Sequence Analysis ofAnti-Rh(D) Heavy and Light Chains 

We previously reported on the use of Fab/phage display and 
cell-surface panning to isolate a large array of anti-Rh(D) 
antibodies from the peripheral blood lymphocytes of a single 
hyperimmunized donor. 22,30 Separate 71 k and 7|\ Fab/phage 
display libraries had been constructed and contained 7 X 10 7 
and 3 X 1 0 8 independent transformants, respectively, based on 
electroporation efficiency. Each library was panned indepen- 
dently using a simultaneous positive/negative selection strategy 
with magnetically labeled Rh(D)-positive RBCs and unmodi- 
fied Rh(D)-neganve RBCs as described. After two rounds of 
panning, 32 of 36 7^ and 1 5 of 1 5 71 k randomly chosen clones 
were positive for anti-Rh(D) activity. After the third round of 
panning, 24 of 24 7^ and 12 of 12 71 k clones were positive. 
Nucleotide sequencing of the 83 positive clones showed a total 
of 28 unique heavy and 41 unique light chains. Because of 
combinatorial effects during phage display library construction, 
heavy and light chain gene segments paired to produce 53 
unique Fab antibodies. 22 

Anti-Rh(D) heavy chains. All of the heavy chain sequences 
used V H III family-encoded gene products (Figs 1 and 2). 
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Fig 1. (A) Dendrogram and 
(B) CDR3 alignment of anti-Rh(D) 
heavy chains. The 28 unique 
heavy chain clones are orga- 
nized by V H family, V H germline 
gene, and VDJ rearrangement. 
Each heavy chain clone is identi- 
fied by a numeral preceded by a 
letter (B through E) that denotes 
its germline gene. The 28 heavy 
chains comprised 12 distinct 
VDJ regions, designated VDJ1 
through VDJ 12. Clones with 
identical VDJ joins putatively re- 
sult from irttraclonal diversity of 
12 original B lymphocytes. 


Several sequences shared identical VDJ joining regions, and 1 2 
unique VDJ rearrangements were identified and designated 
VDJ1 through VDJ 1 2. Alignment of these sequences against the 
V Base Directory of Human V Gene Sequences 29 showed that 
only four V H III genes were used by these antibodies: VH3-21, 
VH 3-30, VH 3-33, and VH 3-30.3. VH3-21 was used by 1 of 
the 12 VDJs and 2 of the 28 clones; VH3-30 by 1 VDJ and 6 
clones; VH3-33 by 9 VDJs and 19 clones; and VH3-30.3 by 1 
VDJ and 1 clone. Interestingly, VH3-30, VH3-33, and VH3- 
30.3 comprise a set of closely related genes (>98% homology; 
Fig 2B) and their next nearest neighbor, VH3-07, is only 90% 
homologous (Fig 2C). Hereafter, these three genes are referred 
to as the VH3-33 superspecies. Heavy chain El differed from 
VH3-21 by 6 mutations and from VH3-48 by 10 mutations; 
hence, it was assigned to the former germline gene. Because 
there were no common mutations among the VH3-33 clones, it 
is highly probable that the donor possessed the VH3-33 
germline gene. However, we could not formally rule out gene 
duplication with allelic variants of VH3-33 or the existence of 
variant alleles of the other germline genes in the donor. The 
isolation of clones sharing multiple VDJ joining regions argues 
that cloning artifacts cannot account for the V H gene restrictions 
observed. 

Neither J H nor D segments showed restriction. At least 9 
different D segments were used and J H gene use comprised J H 6 
(5 VDJs and 9 clones), J H 4 (4 VDJs and 10 clones), J H 3 (2 VDJs 
and 8 clones), and J H 5 (1 VDJ and 1 clone). All four V H genes 
were Chothia class 1-3, 31 and the CDR3s showed a narrow 
range of length from 1 5 to 19 residues. 


Because rearranged heavy chain genes demonstrate extensive 
diversity, clones sharing identical VDJ rearrangements are 
generally considered to have arisen from the same clone. Based 
on nucleotide alignment with the germline genes (data not 
shown), an ontogeny tree was constructed for the 12 VDJs and 
28 clones (Fig 3). By using the most parsimonious mutation 
scheme (ie, postulating the minimum number of mutations), 
putative intermediate antibodies were derived for several of the 
VDJs and were designated Ca, Cb, Da, Db, and Dc (Figs 2 A and 
3). Compared with the isolated heavy chain clones, which 
displayed between 6 and 23 nucleotide differences from their 
germline counterparts, these putative intermediates had be- 
tween 3 and 12 mutations from germline. Based on the 
ontogeny tree, the number of independent mutations could be 
tabulated among the clones. The most commonly mutated 
residues were 52a and 58 (7 independent mutations), followed 
by residues 30, 31, and 50 (6 mutations) and residue 55 (5 
mutations). In the VH3-33 superspecies, residues 52a and 58 in 
CDR2 are tyrosines and residue 52a was mutated to phenylala- 
nine in 6 of the 1 1 VDJs derived from VH3-33 superspecies V H 
genes. Mutations at residue 58 comprised glutamate (3), 
aspartate (2), histidine (1 ), and asparagine (1). The AGY serines 
at residues 30, 3 1 , and 55 were mutated to a number of different 
amino acids, although the AGY serine at 82b was conserved in 
all clones. The valine at residue 50 in the VH3-33 superspecies 
also had a diverse set of mutations. This distribution of hot spots 
is similar to that seen with nonproductive rearrangements as 
previously reported by Dorner et al. 32 
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Fig 2. (A) Alignment of anti-Rh(D) heavy chains to their nearest 
germline V, D, and J genes. Also shown are the putative intermediate 
heavy chain sequences (Ca, Cb, Da, Db, and Dc; see text and Fig 3). The 
number of nucleotide differences from a germline V H is tabulated to the 
right of each sequence. In general, D segments showed poor homology 
with known D genes, so mutations were not scored in these regions. 
Key: Replacement mutations indicated with letters, silent mutations as 
"*", identities as and insertions as Sequences derived from the 
5' V H primers used in library construction 22 are marked as ">". CDR 
region designations are determined as per Kabat 59 ; numbering and H 
region designations per Chothia et al. 31 (B) Alignment of the four VH3 
genes used by anti-Rh(D) heavy chains and (C) dendrogram of all 
human VH3 family germline genes shows relatedness of VH3-21, 
VH3-30, VH3-33, and VH3-30.3 and the surprising restriction in V H gene 
usage. Note that the VH3-30.5 gene is present in only certain haplo- 
types and is identical to VH3-30. 60 Genbank accession numbers for 
anti-Rh(D) heavy chains are listed in the Appendix. 


Anti-Rh(D) light chains. Seventeen of the 1 8 k light chains 
were from the V K I family and the remaining light chain 
originated from a V K II family member germline gene (Fig 4). 
Only 4 V K germline genes were used (15 clones were derived 
from DPK9 alone), and the k light chain clones had between 1 


and 49 mutations from their corresponding V K germline genes. 
All 5 of the known J K genes were used and were each joined to 
the DPK9 gene in one or more clones. Because the light chains 
showed considerably less diversity in their joining regions than 
the heavy chains, it was difficult to assign common clonal 
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Fig 3. Ontogenic tree of anti-Rh(D) heavy chains constructed 
using nucleotide alignment data. Circles represent isolated and 
sequenced clones and diamonds represent putative intermediates 
(see Fig 2A). The number of nucleotide mutations from its germline 
V H gene is shown in parentheses below the clone name. The distance 
along the horizontal axis represents the degree of mutation (includ- 
ing J segments) within the constraints of the diagram. 

origins. However, an ontogeny tree was constructed by group- 
ing common V and J gene segments along with common 
mutations (data not shown). Based on this analysis, the 18 k 
chains comprised at least 10 different recombination events. 

X light chains were restricted by their J x gene usage but 
showed no restriction in their use of V x genes (Fig 5). The 23 X 
light chains all used the J x 2Vasicek gene but were derived from 
V X I (12 clones), V X III (5), V X VII (3), V X II (2), and V X IV (1) 
family genes. The number of mutations ranged from 2 to 41 
from the nearest germline V x gene. Based on common joining 
regions and mutations, these 23 X light chains were derived 
from at least 1 3 different B cells. 

Assessment of the Diversity of the Unpenned Libraries 

To determine whether the apparent restriction in gene usage 
of the anti-Rh(D) antibodies could have been due to preselec- 
tion factors (ie, cloning artifacts), we assessed the diversity of 
the unpanned y\K and 71X Fab/phage libraries. By sequencing 
39 randomly picked clones, we determined that there were no 
duplicate heavy or light chain sequences and that there was 
significant heterogeneity in V gene family representation before 
selection (Fig 6). In fact, the variable region gene family 
distribution was not unlike that found by other investigators for 


IgG-sccreting lymphocytes in adult peripheral blood. 33 Further- 
more, of the 14 V H III-encoded negative clones, only one used a 
VH3-33 superspecies germline gene (VH3-30.3); the other 13 
were encoded by VH3-07 (3), 3-09 (2), 3-15 (2), 3-48 (2), 3-72 
(2), 3-23 (1), and DP-58 (1). Therefore, the restriction of the 83 
anti-Rh(D) clones to the VH3-33, 3-30, 3-30.3, and 3-21 genes 
is significant and not a result of skewed representation of certain 
germline genes within the originally constructed 71 k and 7,X 
Fab/phage libraries. 

Heavy and Light Chain Contribution to Rh(D) 
Epitope Specificity 

Because of the conformational dependency of Rh(D) antige- 
nicity, Rh(D) epitopes have been classically defined through the 
use of RBCs obtained from rare individuals whose cells appear 
to produce Rh(D) antigens lacking certain epitopes. 34 Examin- 
ing the pattern of agglutination of a particular anti-Rh(D) MoAb 
with such sets of partial Rh(D) RBCs enables one to categorize 
that antibody's fine specificity. 

Monoclonal Fab/phage preparations were prepared in tripli- 
cate for each of the 53 anti-Rh(D) clones and tested against a 
panel of Rh(D) category cells Illa/c, IVa, IVb, Va, VI, and VII. 
This panel of cells can differentiate between the Rh(D) epitope 
specificities as described by Lomas et al 6 (designated epitopes 
epDl, epD2, epD3, cpD4, epD5, and epD6/7). Agglutination 
experiments with the Fab/phage clones showed five different 
patterns of reactivity, including a new pattern that had not been 
described in the original study by Lomas et al 6 or in the more 
recently described 9, 30, or 37 epitope systems (Figs 7 and 

3,35 Although nearly all Fab/phage gave unequivocal aggluti- 
nation reactions, a few antibodies gave repeatedly weak patterns 
of reactivity against one of the panel cells. For these reactions, 
monoclonal Fab/phage were prepared on at least four separate 
occasions to verify the patterns of reactivity. 

The most commonly recognized epitope was epD6/7, against 
which 13 clones were directed. Interestingly, monoclonal 
anti-Rh(D) clones isolated using conventional tissue culture 
methods are most often specific for CPD6/7. 34 epD2, epDl, and 
epD3 were recognized by 10, 7, and 2 clones, respectively. Six 
clones agglutinated cells of categories Illa/c, IVa, and VII, but 
not of categories IVb, Va, and VI, and were designated 
anti-epDX. This pattern is identical to epDl, except that the IVa 
cell is agglutinated. Three clones gave intermediate reactions 
with cell IVa, but otherwise showed patterns consistent with 
epDX or epDl . These clones were designated epDX 1 or epDl x , 
depending on whether this reactivity against cell IVa was 
stronger or weaker, respectively (Fig 8). Similarly, reaction 
patterns for epD 1 and epD2 differ by a positive reaction with the 
category Va cell; therefore, one clone was given epD2' specific- 
ity because it gave only moderate reactivity against that cell. 
Such variable reactions against one or more partial Rh(D) cells 
have been observed for anti-Rh(D) MoAbs produced through 
conventional tissue culture methods. 36 

Because of the rcassortment of heavy and light chain gene 
segments that occurs during the construction of a phage display 
library, a number of clones were isolated that shared either a 
heavy (eg, El) or light (eg, M3) chain sequence (Fig 8). Some 
heavy chains were found to have paired with both k and X light 
chains (eg, CI, D20), and each demonstrated anti-Rh(D) 


PH AGE-DISPLAYED HUMAN ANTI-Rh(D) ANTIBODIES 


3071 


DPK9 
I0S 
104 
IIS 

xoa 

116 


112 
Z10 
113 

io e 

109 

111 

DPK9 
101 
103 


DPK9 
I0« 


JK 

JK1 


JK4 
JK5 
JK3 
JK1 
JK4 


FR1 


LI 

CDR1 


L3 
CDR3 


* nucleotide 
differences from 
gernline vx 


1 2 3 4 5 6 7 8 9 10 

12345678901234567890123 4567C901abcdef234 567890123456789 0123456 78901234567890123456789012345678 9012345a67 B9012345678 

DIQK1XJSPSSLSASVCDRVTITC RASQS1SS -YLN WYQQKPGKAPKLLI Y AASSLQS GVPSRFSGSGSCTOFTLTISSLQPEDFATVYC QQSYSTP+WT FCQGTKVEIK 

»»»» RR H F T --Q 

»»»» N. RR 

»»»» N.N.RR 

»»»>> • 

»»»»*. .P. .* T.C 


DIOMTOSPSSLSASVGDRVTITC RASQSISS- 

»»»». . * — 

»»»» N. . ,-■ 

»»>»>. . . . • R— 

»»»»F * * T. .R-- 

»»>»> -• 

»»»» — 


DIQKTQSPSSLSASVGroVTrPC RASQSISS 

»»»» ..*...*. 

»»»» A T.RN.NR 


DIQHTQSPSSLSASVGDRVTITC RASQSISS— 

»»»>> — ■ 


DIQMTQSPSSLSASVGEKVnTC RASQSISS- 

»»»» • -* 


»»»» . 


DIQKTOSPSSLSASVCDRVTrrC RASQCIRN- 

»»»» F. .-■ 





-YLN WYQQKPGKAPKLLIY AASSLQS GVPSRFSGSGi 

3GTDFTLTISSLQPEDFATYYC QQSYSTP+YT FGQGTKLEIK 








-YLN WYQQKPGKAPKLLIY AASSLQS GVPSRFSGSC; 

5GTDFTLTISS LQPEDFATYYC QQSYSTP+FT PGPGTKVDIK 




—YLN WxQQKFGKAPKLLIi AAi&Uj> uvr bKt 1*>W. 



SGTDPTLTT SS LQPEDFATYYC QQSYSTP»IT PGQGTRLEIK 

— YLN WYQQKPGKAPKLLIY AASSLQS GVPSRFSGSC! 


— YLA WYQQKPGKAPKLLIY AASTLQS GVPSRFSGSGSGTEPTLTI SS LQPEDFATYYC QQLNSYP*FT PGPGTKVDIK 

•-DLG WQQKPGKAPKRLIY AASSLQS GVPSRFSGSG! 

SGTEPTLTI SSLQPEDFATYYC LQHNSYP+WT PGQGTKVEIK 


23 
11 
5 


4 

13 


DIVKIQSPLSLPVTPCEPASISC RSSQSLLHSNGYN-YLD WYLQKPGQSPQLLIY LGSNRAS GVPDRFSGSGSGTDPTLK1SRVEAEDVGVYYC MQALQTP+LT FGGGTKVETK 


B 

VK 

OHHS FAN. 

DPK9 I DIQWTQSPSS LSASVGDRVTITC RASQSISS YLN WYQQKPGKAPKLLIY AASSLQS GVPSRFSGSGSGTDFTLTI SSLQPEDFATYYC QQSYSTP 

DPKB I ...L F O. .. ..A T B LN.Y. 

A30 1 CRN D.G R E L.HN.Y. 

DPK1S II .-V L..PVTP.EPAS.S. -S. . . LLHSNGYN-. . D . .L....Q5.Q. ... LG.NRA. . . .D K. .RVEA. .VCV. . . M.ALQ. . 

Fig 4. (A) Alignment of anti-Rh(D) k light chains to their nearest germline V and J genes shows predominance of DPK-9 usage from the VJ 
family. Nomenclature for clones is similar to that for heavy chains but uses the letters F through I. (B) Alignment of the four V„ genes used by 
anti-Rh(D) light chains. The key is the same as that used in Fig 2A. Genbank accession numbers for ant*-Rh(D) k light chains are listed in the Appendix. 


specificity. Interestingly, some heavy chains (eg, El, D12) 
mapped to different epitopes depending on the light chains with 
which they were paired. In particular, the light chains of two 
such clones, E1/M2 and E1/M3, differed by only 3 amino acid 
residues (Fig 5) and these differences appear to confer specific- 
ity for epD2 versus epD3. 

Inhibition Studies 

To investigate the topological relationships among the Rh(D) 
epitopes, inhibition studies were performed. Previous work by 
Gorick et al 37 using pairs of unlabeled and ,25 I-labeled anti- 
Rh(D) MoAbs demonstrated that antibodies to at least 3 
different Rh(D) epitopes (subsequently identified as epDl, D6, 
and D7) 6 could inhibit one another. We have confirmed and 
extended these findings using recombinant antibodies to 5 
Rh(D) epitopes (Fig 9). In one series of experiments, we 
exploited the ability to express each antibody in both a soluble 
Fab as well as phage-displayed form to ask whether a soluble 
Fab against one epitope would inhibit the agglutination induced 
by an Fab/phage directed against a different epitope. Reciprocal 
pairs of soluble Fab and Fab/phage specific for epDl, epD2, 
epD3, epD6/7, and epDX were tested. All 10 combinations 
showed mutual inhibition patterns (shown in Fig 9A for an 
anti-epD3/anti-epD6/7 combination). To show that this inhibi- 
tion was not due to nonspecific factors, a control with an 
irrelevant RBC-binding recombinant antibody (an anti -blood 
group B antibody) was performed (Fig 9B). That sufficient 
inhibitory amounts of soluble Fab was present were first 


verified by demonstrating that each soluble Fab could inhibit its 
own Fab/phage (Fig 9 A and B; samples on diagonal). Similar 
results were obtained using pairs of soluble Fabs which differed 
in their light chain isotype composition (Fig 9C). 

Isoelectric Point (pi) Analysis of Anti-Rh(D) Antibodies 

The restriction in V H germline gene usage to only four V H III 
family members was intriguing in light of their ability to confer 
specificity to a number of Rh(D) epitopes. As suggested by 
Boucher et al, 10 V H germline gene segments used to encode 
anti-Rh(D) antibodies are among the most cationic segments 
available in the human V H repertoire that may be used to 
account for the relatively high pi of polyclonal anti-Rh(D)- 
containing antisera. 38 ' 39 Although the cationic nature of the 
antibodies may be important for binding to Rh(D), it has also 
been suggested that a constitutive net positive charge may be 
necessary to permeate the highly negative RBC 5 potential, thus 
permitting antibody to contact antigen. 34 In either case, analysis 
of the predicted pi for the 28 heavy chains and 41 light chains 
isolated here showed an interesting phenomenon for the heavy 
versus light chains. Using the pi interval scale of Boucher et 
al, 10 the average pi of the 4 germline V H segments used to 
encode the 28 heavy chains is high (9.87 ± 0.15) and 
significantly higher than that of 39 randomly picked, non-Rh(D) 
binding clones from the original unpanned libraries (9.24 ± 
0.80, P < 10" 5 ). Similar to the results of Boucher et al, 10 the 
addition of D and J H segments and the introduction of somatic 
mutation did not significantly change the pi of the average 
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Fig 5. (A) Alignment of anti-Rh(D) X light chains to their nearest germline V and J genes and (B) alignment of the 10 V x germline genes used 
shows the use of a diverse set of variable region genes derived from multiple families. However, all of the clones use the identical J x gene 
segment. Nomenclature for the clones is similar to that for heavy chains but uses the letters J through S. The key is the same as that used in Fig 
2A. Genbank accession numbers for anti-Rh(D) X light chains are listed in the Appendix. 


anti-Rh(D) heavy chain (9.8 1 ± 0.33, P < .37). However, for 
the light chains, the average pi of their germline counterparts 
was not cationic, but the light chains became so through the 
addition of J L segments and somatic mutation. Overall, for all 


1 8 k and 23 X light chains, paired /-test analyses before and after 
somatic mutation showed a significant increase in net positive 
charge when comparing germline V L (6.63 ± 1.47) with 
expressed V L (7.28 ± l.5l,/> < 1 0" 3 ) or germline V L J L (7.43 ± 
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Fig 6. Comparison of variable region gene family usage for anti-Rh(D)-specific clones with those used by randomly picked, non-Rh(D)-binding 
clones from original 7i k and 7l X unselected libraries. Lightly hatched bars reveal heterogeneity in V H (left panel), V„ (middle panel), and V x (right 
panel) family representation before selection for anti-Rh(D) specificity. Numbers above bars represent absolute number of clones in that group. 


PHAGE-DISPLAYED HUMAN ANTI-Rh(D) ANTIBODIES 

CLONE Rh(D)VARIANT ASSIGNED 
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Fig 7. Determination of the Rh(D) binding epitope of anti-Rh(D) 
Fab/phage clones. The fine specificities of monoclonal Fab/phage 
clones were determined by their ability to agglutinate members of a 
panel of six Rh(D) variant RBCs. Shown are the five different aggluti- 
nation patterns obtained from screening all of the 53 Fab/phage 
clones. The particular clones shown are identified by their unique 
heavy chain/light chain pairings using the nomenclature defined in 
Figs 1, 4, and 5. For E1/M3, reactivity with additional Rh(D) variant 
cells would be required to distinguish its specificity for epD3 versus 
epD9. 3 Rh(D) epitope assignments are as per Lomas et al. G Note that 
inclusion of the category IVb cell (not available in our previous 
study) 22 permits the identification of a new epitope designated epDX 
(see text). 

1 .47) with expressed V L J L (8.55 ± 1 .35, P < 10" 7 ). There was 
no significant increase in a similar analysis of 16 non-Rh(D) 
binding clones (P < .59 and P < . 1 9, respectively). Examina- 
tion of the light chain sequences (Figs 4 and 5) showed that this 
increase in pi resulted from mutations that not only introduced 
positively charged residues, but also eliminated some nega- 
tively charged residues. There were 3 1 such events, 29 (91%) of 
which occurred in the light chain CDR regions. 

DISCUSSION 

Conventional and Phage-Displayed Anti-Rh(D) MoAbs 

Because of differences in methodology, we were interested in 
comparing our phage-display-derived anti-Rh(D) clones with 
those produced by conventional tissue culture techniques (EBV 
transformation and cell fusion). Despite the relatively small 
number of previously published sequences for IgG anti-Rh(D) 
antibodies (N = 21) and the fact that they were derived from 
over 10 different donors, 10 ' 23 " 25 there was surprisingly good 
correlation between the two groups (Table 1). Both cohorts 
demonstrated a predominance of V H HI-family encoded germ- 
line genes, particularly those of the VH3-33 superspecies. 
CDR3 regions showed similar lengths, ranging from 1 5 to 19 
residues for Fab/phage antibodies and 16 to 20 for conventional 
monoclonals, although one heterohybridoma was an outlier 
with a CDR3 length of 28 residues, k light chains were biased 
towards V K 1 family members and \ light chains demonstrated 
the preferential use of the J x 2Vasicek gene. The only qualitative 
discrepancy was in V x family usage, where Fab/phage clones 
demonstrated a slight preference for V X I versus V X III family 
members for conventional monoclonals. However, in both 
cohorts, DPL16 was used more often than any other \ light 
chain gene. 

It has been suggested in the literature that the VH4-34 
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(VH4.21) germline gene, a gene used by many autoantibodies 
and cold agglutinins, 40-42 may play an important role in the 
immune response to Rh(D). 43 However, these conclusions arose 
from the analysis of IgM monoclonals and only 2 of the 21 
published anti-Rh(D) IgG sequences used VH4-34. 25 In a 
related series of experiments, we pooled aliquots of the 7jk and 
7i\ libraries obtained after the second and third rounds of 
selection and then panned them against the VH4-34 specific rat 
anti-idiotypic MoAb (9G4 44 ). Although we successfully en- 
riched for VH4-34 encoded antibodies, the Fab/phage were not 
specific for Rh(D) and displayed serological characteristics 
similar to those of cold agglutinins (data not shown). We are 
currently examining a n phage display library from the same 
donor to compare gene usage. 

Rh(D) Epitopes and Significance of Antibody Sequences 

Since the initial report by Argall et al 45 in 1953, it has been 
recognized that rare individuals who type as Rh(D)-positive can 
produce allo-anti-Rh(D) antibodies in response to Rh(D) immu- 
nization by transfusion or pregnancy. This phenomenon was 
explained by hypothesizing that the Rh(D) antigen is a mosaic 
structure and that these individuals were producing alloantibod- 
ies to parts of the mosaic they lack. By systematically examin- 
ing patterns of reactivity between their cells and sera, RBCs 
expressing partial Rh(D) antigens were divided into categories, 
each presumed to have a different abnormality in their Rh(D) 
antigen. Through the subsequent use of index panels of 
monoclonal anti-Rh(D) antibodies, a series of epitopes were 
defined of which the number and combination varied from one 
Rh(D) category to another. As new monoclonals were produced, 
their reactivity profiles against these partial Rh(D) RBCs 
became the standard method for determining Rh(D) antibody 
epitope specificity. Molecular analyses of partial Rh(D) pheno- 
types have shown that the Rh(D) genes in these individuals have 
either undergone intergenic recombination with the highly 
homologous Rh(CE) gene or, less commonly, have sustained 
point mutation(s). 46 

As noted earlier, to investigate the topological relationships 
among Rh(D) epitopes, Gorick et al 37 performed competition 
experiments with Rh(D) MoAbs and observed varying degrees 
of inhibition. These results, when combined with those of 
Lomas et al, 6 suggested a model for Rh(D) in which epitopes are 
spatially distinct yet demonstrate a certain degree of overlap, as 
shown in Fig 1 OA. This model explained how antibodies to two 
different Rh(D) epitopes (in this case epD2 and epD3) could 
inhibit each other's binding to wild-type Rh(D) and how a 
change in the structure of Rh(D) in category VI RBCs (asterisk, 
Fig 1 OA) would cause the loss of epD2. However, based on this 
concept of Rh(D) epitopes as distinct domains, we would expect 
that antibodies against different epitopes of Rh(D) would be 
structurally and genetically distinct as well. Thus, it was 
surprising that our anti-Rh(D) clones demonstrated such marked 
restriction in gene usage. For example, only two superspecies of 
V H genes were used despite specificities for 4 of the original 6 
Rh(D) epitopes described by Lomas et al. 6 Furthermore, 
multiple specificities could arise from a single heavy chain 
depending on the light chain with which it was paired (eg, E 1 
with M2, M3, L3, or L4). In addition, other clones repeatedly 
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Fig 8. Matrix illustrating the 
genetic composition and epi- 
tope specificity of anti-Rh(D) an- 
tibodies. The horizontal axis rep- 
resents the unique 71 heavy 
chains and the vertical axis repre- 
sents the unique X and k light 
chains (based on amino acid se- 
quence). A shaded pattern at the 
intersection of a heavy chain/ 
light chain pair indicates the 
Rh(D) epitope specificity ob- 
served for that Fab/phage anti- 
body. A few clones gave mixed 
patterns of reactivity, as shown 
(see text). The order of heavy 
chains (left to right) and light 
chains (top to bottom) was deter- 
mined by the multiple alignment 
of amino acid sequences as in 
Figs 2, 4, and 5. Note that heavy 
chains D1, D15. D16, and D17, 
although differing in nucleotide 
sequence, have the identical 
amino acid sequences and thus 
comprise a single column. Simi- 
larly, heavy chains C5 and C8 and 
X light chains K1 and K2 encode 
the same proteins. The pairings 
of these 28 heavy and 41 light 
chain nucleotide gene segments, 
which produced 53 unique Fab 
transcripts, encoded 43 different 
Fab proteins, as indicated in the 
matrix. 


demonstrated variable weak reactivity against certain Rh(D) 
category RBCs that would affect the epitope specificities to 
which they were assigned (eg, CI with Ol , Ml , or J5). 

Several hypotheses could account for these findings. The 
most simplistic interpretation is that the heavy chain does not 
directly interact with the antigen, but rather is responsible for 
bringing the antibody in close proximity with the antigen. The 
specific interactions between the light chain and the antigen 
would then determine the epitope specificity for that antibody. 
In this regard, our data are consistent with the observations of 
Boucher et al 10 on the relative cationic nature of anti-Rh(D) 
heavy chains. However, because we found that light chains 
become cationic during somatic mutation, the charge of the 
entire antibody may play a role in its ability to bind, resulting in 
the selection and expansion of particular B-cell clones. 

A more compelling hypothesis is that Rh(D) epitopes do not 
differ spatially but differ only in the number and arrangement of 


contact residues presented (Fig 10B). In other words, the 
footprints of most, if not all, anti-Rh(D) antibodies are essen- 
tially identical to one another. The genetic events that produce 
partial Rh(D) molecules result in the loss of certain critical key 
points of contact necessary for some antibodies to bind; 
alternatively, they result in the formation of new structures that 
interfere with the binding of other anti-Rh(D) Igs. For example, 
the introduction of a ledge in Rh(D) category VI cells (asterisk, 
Fig 10B) does not interfere with the binding of an anti-epD3 
antibody, but does prevent the binding of anti-epD2. Therefore, 
category VI RBCs are said to have epD3 but lack epD2. 

This model is consistent with our inhibition experiments (Fig 
9) and with those of Gorick et al 37 and offers an explanation for 
the marked restriction in heavy chain gene usage. It also 
reconciles a mechanism by which one heavy chain (eg, El) can 
confer binding to multiple epitopes and why some of our 
recombinant anti-Rh(D) antibodies, as well as some convention- 
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Fig 9. Inhibition studies with recombinant anti-Rh(D) antibodies. Panels show results of representative experiments demonstrating the 
mutual inhibition of antibodies directed at 2 different Rh(D) epitopes On this example, epD3 and epD6/7; A and C), but not between an Rh(D) 
antibody and an unrelated recombinant anti-RBC antibody (an anti-blood group B antibody; B). In (A), Rh(D)-positive RBCs were incubated with 
soluble Fabs only, phage-displayed Fabs only, or combinations of the two, as indicated. In (B), Rh(D)-positive RBCs that were blood group B were 
used. After washing, RBCs were resuspended in anti-M13 antibody and assessed for agglutination induced by phage-displayed Fabs. Soluble 
Fabs were used full-strength, whereas Fab/phage preparations were present in limiting amounts to increase the sensitivity of the inhibition 
assay (see the Materials and Methods). In (C), mutual inhibition of epD3 and epD6/7 anti-Rh(D) antibodies was demonstrated with Rh(D)-positive 
RBCs, y,k and y y \ soluble Fabs, and light chain isotype-specific antisera (see text for details). In these examples, the anti-epD3 and anti-epD6/7 
antibodies were clones E1/M3 and D5/I3, respectively. The anti-blood group B antibody was isolated from an IgG phage display library made 
from the splenic B cells of a blood group O donor. 61 


ally produced monoclonals, 36 display variable reactivity against 
certain categories of partial Rh(D) RBCs. From the antigen's 
perspective, this model explains how a single point mutation in 
Rh(D) can result in the loss of multiple Rh(D) epitopes (such as 
T283I in category HMi RBCs 47 ) and how the residues associ- 
ated with the expression of some epitopes appear to be 
distributed among nearly all the extracellular loops of Rh(D). 48 
It also provides an understanding as to how >37 epitopes can fit 
on the relatively small extracellularly exposed surface of the 
Rh(D) molecule. 3 


This concept of coincident epitopes is best exemplified by 
comparing the E1/M2 and E1/M3 clones. The only difference 
between the reactivity of E1/M2 and El /M 3 is the ability of the 
latter antibody to agglutinate Rh(D) category VI cells (Fig 7). 
Hence, E1/M2 is classified as an anti-epD2 and E1/M3 as an 
anti-epD3 antibody. Light chains M2 and M3 differ by only 3 
residues: D82A, G95aA, and W96V (Fig 5). Therefore, some 
combination of these residues confers reactivity against cat- 
egory VI cells. In other words, epD2 and epD3, as seen by the 
E1/M2 and E1/M3 antibodies, differ by the binding constraints 


Table 1. Comparison of Current IgG Fab/Phage Library-Derived Anti-Rh(D) MoAbs With Those Previously Produced 

by Conventional Tissue Culture Methods 


Attribute 

Previously Published* 


Current Study 


Heavy Chains 


(by clone)t 


(by VDJ) 

VH3 family derived 

12/21 (57%) 

28/28 (100%) 


12/12(100%) 

VH3-33 superspecies*/VH3 

10/12(83%) 

26/28 (93%) 


11/12(92%) 

VH3-33/VH3 

9/12(75%) 

19/28 (68%) 


9/12(75%) 

VH3-21/VH3 

1/12(8%) 

2/28 (7%) 


1/12(8%) 

VH4-34 derived 

2/21 (10%) 

0/28 (0%) 


0/12(0%) 

JH6 usage 

15/21 (71%) 

9/28 (32%) 


5/12(42%) 

CDR3 length 

16-20 (28§) 


15-19 


k Light Chains 





VkI family derived/total k 

8/12(67%) 


17/18(94%) 


JkI usage/total k 

4/12(33%) 


6/18(33%) 


Jk2 usage/total k 

4/12(33%) 


6/18(33%) 


X Light Chains 





VX1 family derived/total X 

2/8 (25%) 


12/23 (52%) 


VX 3 family derived/total X 

5/8 (63%) 


5/23 (22%) 


DPL16 derived/VX3 family 

3/5 (60%) 


4/5 (80%) 


JA2 Vasicek usage/total X 

6/8 (75%) 


23/23 (100%) 



* Compiled from a total of 21 sequences of IgG anti-Rh(D) antibodies isolated from multiple subjects originally published by Bye et al, 25 
Hughes-Jones et al, 23 Chouchane et al, 24 and Boucher et al 10 and available from Genbank. One light chain (Oak-3) 25 was not available in Genbank 
and was not included in the assessment. 


tFor heavy chains, the left column tabulates each clone separately and the right column tabulates clones on the basis of shared V-D-J joining 
regions. 

tVH3-33 superspecies defined as the group of VH3 family germline genes comprising VH3-33. VH3-30, and VH30.3. 
§CDR3 length outlier 
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precursor anti-epD3 *~ anti-epD2 Rh(D)-negative Individual 

precursor anti-epD3 anti-epD2 Rh(D)-variant VI individual 


Fig 10. Conventional (A) and 
proposed (B) models for Rh(D) 
antigen/antibody binding. Note 
that the predicted combining 
sites and genetic relationships 
between antibodies differ be- 
tween the two models. (C) If 
antibodies directed at different 
Rh(D) epitopes are clonally re- 
lated, then the expressed reper- 
toire will differ between Rh(D)- 
negative and partial Rh(D) 
individuals (see text for discus- 
sion). 


imposed by at most three mutations. If the model depicted in Fig 
10A were correct and the epitopes were independent, these 
mutations would have to cause enough structural alteration in 
the antibody combining site so that a completely separate 
epitope on the same antigen would be recognized. It would 
seem unlikely that these 3 mutations could cause such a change, 
especially given the lack of internal homology domains in 
Rh(D). Thus, we conclude that it is far more plausible that the 
footprints of these 2 antibodies are essentially identical and that 
one or more of these mutations (eg, the tryptophan in CDR3 of 
M2) prevent(s) the interaction of E1/M2 with category VI 
RBCs. Because other clones demonstrate that the light chain 
can confer specificity against epDl , epD2, or epD3 (with the El 
heavy chain); epDl or epDX (with C5); and epDl, epD2, and 
epD6/7 (with D12), we suggest that all 5 of these epitopes have 
similar antibody combining sites. 

Immunologic and Clinical Implications of Proposed Model 

The model depicted in Fig 10B leads to additional predictions 
concerning the Rh(D) immune response beyond simply clarify- 
ing what is meant by an Rh(D) epitope. It is commonly stated in 
the transfusion medicine literature that individuals whose RBCs 
express partial Rh(D) antigens are free to make antibodies to the 
Rh(D) epitopes they lack. 34 Therefore, an individual who 
produces category VI RBCs should be able to make anti-epD2 
but not anti-epD3. If these epitopes were truly independent, 
then the immune repertoire of the anti-epD2 antibodies made by 
a category VI individual would be similar to those produced by 
an Rh(D)-negative person. However, to the immune system, 
epD2 and epD3 are not independent. We postulate that the 
somatic mutation of an anti-epD3 antibody can change its fine 
specificity to that of epD2 (or vice versa, Fig IOC). Suppose that 
the preferred way of making an anti-cpD2 antibody is to go 


through an anti-epD3 intermediate. To an Rh(D)-negative 
individual, this process can take place unimpeded. However, in 
a category VI individual, this route would be unfavorable 
because an anti-epD3 antibody would be self-reactive. As a 
result, such an individual would have to make anti-epD2 
antibodies by following alternative routes or by tolerating some 
degree of autoreactivity in the process. With respect to the latter 
point, it is of interest to note that a transient production of 
auto-anti-Rh(D) frequently precedes or accompanies the early 
production of allo-anti-Rh(D) in individuals who express partial 
Rh(D) antigens. 49 " 54 We would predict, therefore, that the 
anti-epD2 antibodies from a category VI individual would be 
different in composition (ie, gene usage) and quite possibly 
quantitatively depressed as compared with an Rh(D)-negative 
individual. This may be analogous to the antibodies of the ABO 
blood group system in which it has been observed that anti-A 
and anti-B titers in blood group O individuals are significantly 
higher than in blood group B or A individuals, respectively. 55 
Blood group O individuals are unconstrained in creating their 
anti-A and anti-B immune repertoires, whereas individuals who 
produce A or B antigens (2 nearly identical structures) must do 
so in a manner that avoids self-reactivity. 

In the case of antibodies E1/M2 and E1/M3, they appear 
to have arisen from a common precursor B cell rather than 
directly from each other (Fig 5). To test the framework of our 
hypothesis, ie, somatic mutation resulting in epitope migration 
of an antibody, we are constructing the precursors and potential 
intermediates between the M2 and M3 light chains and will then 
determine what Rh(D) epitope specificities (if any) they ex- 
press. This concept of epitope migration has been previously 
reported for murine anti-cryptococcal 56 and anti-type II colla- 
gen 57 antibodies. 

If the proposed model for Rh(D) epitopes is correct, then the 
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question of the number of epitopes may be obsolete. There may 
be as many epitopes as can be differentiated by the number of 
cell categories, ie, 2 n epitopes, where n is the number of distinct 
partial Rh(D) RBCs. A more important question is the interrela- 
tionships between the various epitopes. For example, are some 
epitopes further away than others — not in the topological sense, 
but in terms of the number of mutational hits an antibody needs 
to receive to change its serologic reactivity. Furthermore, does 
the humoral immune response in a partial Rh(D) individual 
differ from that in an Rh(D)-negative individual in the manner 
predicted by this model? One may find that allo-anti-Rh(D) 
antibodies made by partial Rh(D) individuals are not as 
clinically significant, ie, capable of inducing hemolysis. This 
may explain why hemolytic disease of the newborn due to 
anti-Rh(D) produced by pregnant individuals with partial Rh(D) 
phenotypes is so rare even when taking into account the low 
prevalence of the partial Rh(D) phenotypes. 34 A better under- 
standing of the immune response to Rh(D) in these patients may 
alleviate concerns regarding the need to identify such individu- 
als to ensure that they only receive Rh(D)-negative blood 
products for transfusion and Rh(D)-immune globulin during 
pregnancy. 58 Furthermore, with respect to the design of recom- 
binant Rh(D)-immune globulin for use in Rh(D)-negative 
patients, it may not be necessary to formulate cocktails of 
MoAbs containing multiple Rh(D) epitope specificities. 

In summary, we have studied the genetic and immunological 
properties of a large array of anti-Rh(D) antibodies to elucidate 
this clinically significant human immune response on a molecu- 
lar level. Our results show that anti-Rh(D) antibodies display a 
high degree of structural relatedness and the ability to inhibit 
each other's binding despite differences in epitope specificity. 
These findings suggest that Rh(D) epitopes are not spatially 
distinct and that Rh(D) antibodies may undergo epitope migra- 
tion as a result of somatic mutation. The end result is that the 
prevalence of certain anti-Rh(D) specificities in the immune 
repertoire may be a function not only of what epitopes an 
individual lacks, but of the number of accessible pathways that 
the individual's immune system can use that avoid self- 
reactivity. This process may be a general feature of human 
immune responses to other clinically significant, closely related 
epitopes. 
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APPENDIX 

Gcnbank accession numbers for anti-Rh(D) heavy chains are as 
follows: B01, AF044419; C01, AF044420; C03, AF044421; C04, 
AF044422; C05, AF044423; C08, AF044424; CIO, AF044425; D01, 
AF044426; D03; AF044427; D04, AF044428; D05, AF044429; D07, 
AF044430; D08, AF044431; D09, AF044432; D10, AF044433; Dll, 
AF044434; D12, AF044435; D13, AF044436; D14, AF044437; D15, 
AF044438; D16, AF044439; D17, AF044440; D18, AF044441; D20, 
AF044442; D30, AF044443; D31, AF044444; E01, AF044445; E03, 
AF044446. Gcnbank accession numbers for antiRh(D) k light chains 
arc as follows: F01, AF044447; G01, AF044448; HOI, AF044449; 101, 
AF044450; 102, AF044451; 103, AF044452; 104, AF044453; 105, 
AF044454; 106, AF044455; 107, AF044456; 108, AF044457; 109, 


AF044458; 110, AF044459; 111, AF044460; 112, AF044461; 113, 
AF044462; 115, AF044463; 116, AF044464. Genbank accession num- 
bers for anti-Rh(D) k light chains are as follows: J01, AF044465; J02, 
AF044466; J04, AF044467; J06, AF044468; K01, AF044469; K02, 
AF044470; K03, AF044471; L01, AF044472; L03, AF044473; L04, 
AF044474; L05, AF044475; M01, AF044476; M02, AF044477; M03, 
AF044478; N01, AF044479; N02, AF044480; O01, AF044481; O02, 
AF044482; O03, AF044483; P01, AF044484; Q01, AF044485; R01, 
AF044486; S01, AF044487. 
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